Gene signature of electroshock therapy and methods of use

ABSTRACT

Gene signatures associated with electroconvulsive therapy (ECT) and/or electroconvulsive seizures (ECS) are disclosed. Such “signatures” are useful, e.g., for diagnosing and treating neuropsychiatric disorders such as major depressive disorder (MDD), bipolar affective disorder (BAD) and psychotic depression. Such methods are therefore also provided here. The invention additionally provides screening methods that use gene signatures of the invention to identify new therapeutic compounds for treating these disorders.

1. CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] Priority is claimed under 35 U.S.C. 119(e) to U.S. provisional application Serial Nos. 60/411,718; 60/431,882; and 60/479,970 filed on Sep. 18, 2002; Dec. 9, 2002; and Jun. 18, 2003, respectively. The contents of these priority applications are hereby incorporated by reference in their entireties.

2. FIELD OF THE INVENTION

[0002] The invention relates to methods and compositions for the diagnosis and treatment of neuropsychiatric disorders, including major depressive disorder (MDD), bipolar affective disorder (BAD), schizophrenia and psychotic depression. More specifically, the invention relates to gene signatures, that are associated with electroshock therapy and are useful for diagnosing and treating such neuropsychiatric disorders.

3. BACKGROUND OF THE INVENTION

[0003] Current treatments for the psychiatric depression, such as that associated with major depressive disorder (MDD), bipolar affective disorder (BAD), and psychotic depression, comprise several classes of antidepressant drugs. Unfortunately, many patients with these forms of depression fail to respond to such drug therapies. In such cases, electroconvulsive therapy (ECT) may be the only avaiable treatment. Indeed, ECT does produce a high level of success in patients where other pharmacological treatment regimens have failed.

[0004] Psychiatric depression associated with conditions such as major depressive disorder MDD) and psychotic depression are treated with several class of antidepressant drugs, all of which require several weeks to be effective. Unfortunately, many people do not respond to these drug therapies. In such cases, electroconvulsive therapy (ECT) remains the treatment. Although severe, ECT is recognized as a highly effective and rapid treatment, and it produces a high degree of success in patients where other therapeutic regimens have failed.

[0005] Receptor interactions and other pharmacological actions of many chemical antidepressants have been well studied and are characterized to at least some degree. By contrast, the mechanism or mechanisms of ECT remain poorly understood. Molecular effects of electroconvulsive seizure (ECS) in the brains of animals have been studied, as a model for ECT (for a review, see Fochtmann, Psychopharmacology Bulletin 1994, 30:321-444). The effects of ECS are divers, and include increases in levels of neurotransmitters, neuropeptides and synaptic remodeling, including sprouting. Id. ECS also afects several brain regions, including the hippocampus, frontal cortex, neostriatum, entorhinal cortex, temporal-parietal cortex, and monoaminergic nuclei that project to these areas. A summary of the effects reported in these regions is provided in a Table, attached as Appendix to this specification (see, Section 8.1, infra). These changes include, inter alia: increases in tyrosine kydroxylase in the locus coeruleus and diverse monoamine nerve terminal regions (Masserano et al., Science 1981, 214:662-665); decreases in β-adrenergic receptors, typically in the dentate gyrus of the hippocampus and in the frontal cortex (Biegon & Israeli, Eur. J. Pharmacol. 1986, 123:329-334); and increases of neurotrophic factors which can positively modulate monoaminergic neurotransmission (Altar et al., J. Neurochem. 1994, 63:1021-1032; Mamounas et al., J. Neurosci. 1995, 15:7929-7939; Martin-Iverson et al., J. Neurosci. 1994, 14:1262-1270; Siuciak et al., Pharmacol. Biochem. Behav. 1997, 56:131-137). Others have reported that the mRNA for brain-derived neurotrophic factor (BDNF) is upregulated in seizures (Isackson et al., Neuron 1991, 6:937-948; Nibuya et al., J. Neurosci. 1995, 15:7539-7547; Rocamora et al., Brain Res. Mol. Brain Res. 1992, 13:27-33).

[0006] Yet, the above-mentioned changes as well as those summarized in the Appendix, infra, were identified by investigations that examined changes of only very limited numbers of mRNA species in any brain region. No large-scale gene analysis following ECS has been reported. Hendriksen et al., (Eur. J. Neurosci 2001, 14:1475-1484) have described measured changes in the expression of multiple genes in the hippocampus, but only eight days after local seizures induced by electrical stimulation had been terminated by pentobarbital. Lukasiuk et al. (Eur. J. Neurosci. 2003, 17:271-279) report experiments where hippocampal and temporal lobe expression profiling was conducted in animals subject to amygdala kindling, but only before they developed spontaneous seizures.

[0007] The citation and/or discussion of a reference in this section and throughout the specification is provided merely to clarify the description of the present invention and is not an admission that any such reference is “prior art” to the invention described herein.

[0008] 4. SUMMARY OF THE INVENTION

[0009] The present invention overcomes problems, such as those described supra, by providing ECS signature genes—i.e., nucleic acids that are differentially expressed in individuals undergoing an ECS treatment or in subjects undergoing electroconvulsive therapy (ECT). Preferred ECS signature genes of the invention are set forth in the tables at the appendix, infra. See, in particular, in Sections 8.3-8.5 below.

[0010] The invention also provides ECS gene signatures, which represent collections of one or more ECS signature genes and, preferably, expression levels of these genes which are indicative of ECS or ECT. In additiona, the invention also provides methods and algorithms for identifying ECS signature genes and for selecting preferred ECS signature genes, e.g., for use in an ECS gene signature.

[0011] ECS genes signatures of the invention, and the ECS signature genes they comprise, are useful, e.g., for diagnostic and prognostic methods of identifying people who have or who are susceptible to neuropsychiatric disorders, including inter alia major depressive disorder (MDD), bipolar affective disorder (BAD) and psychotic depression. Such methods therefore are also provided and considered a part of the present invention. The invention additionally provides therapeutic methods, which use ECS gene signatures and ECS signature genes to treat, ameliorate or prevent such neuropsychiatric disorders. The invention further comprises screening methods, including MPHTS(G methods, that use ECS gene signatures and/or ECS signature genes to identify compounds that are useful in such methods for treating neuropsychiatric disorders.

[0012] The methods of the invention include methods for identifying compounds for treating neuropsychiatric disorders, such as schizophrenia, autism, MDD, BAD, schizophrenia and psychotic depression. These methods generally comprise steps of contacting a cell or cells with a test compound, and determining expression of one or more signature genes by the cell or cells. These signature genes typicaly comprise nucleic acids that hybridize to a nucleic acid selected from the group consisting of SEQ ID NOS:1-152 and the complements thereof. The determined expression of these one or more signature genes is compared to expression of those signature genes in a cell or cell that is not contacted with the test compound. Changes in expression of the one or more signature genes (compared to expression in cells not contacted with the test compound) indicate that the compound is useful for treating the neuropsychiatric disorder.

[0013] The invention additionally provides methods for identifying and selecting ECS signature genes. In particular, the invention provides methods for selecting one or more nucleic acids that are indicative of an effective therapy for treating a neuropsychiatric disorder. The method comprises identifying nucleic acids that are differentialy expressed in an individual subjected to electroconvulsive seizure (ECS) compared to an individual not subjected to ECS. In various aspects of these methods, the individual may be subjected to either actue ECS or chronic ECS, and the nucleic acids identified by this method are preferably ones expressed in the brain or in a region of the brain (preferably the frontal cortex or the hippocampus). In preferred embodiments, a score value is obtained for each of the identified nucleic acids. The score value is preferably a function of each gene's differential expression in individuals subjected to ECS and can be determined according to an objective method or algorithm, such as the method set forth in Section 8.2 (infra) for selecting ECS signature genes that are most informative and therefore most useful for the assays and other methods of this invention.

[0014] The invention further provides kits for detecting an ECS gene signature. Kits of the invention generally comprise a plurality of oligonucleotides, each of which is capable of specifically hybridizing to a different ECS signature gene. For example, in preferred embodiments the oligonucleotides in such kits are capable of specifically hybridizng to ECS signature genes selected from the genes set forth in the appendix, infra (see, in particular, at Sections 8.3-8.5 below), such as SEQ ID NOS:1-152, as well as homologous and complementary sequences thereof. Preferred kits of the invention may comprise, for example, oligonucleotide probes that are immobilized on a solid surface or support, such as in an expression array. In other embodiments, kits of the invention may comprise a plurality of oligonucleotide primers, more preferably a plurality of primer pairs, wherein each pair is capable of amplifying a particular ECS signature gene (for example in a PCR reaction). Kits of the invention may additionally comprise oligonucleotide primers that are capable of priming reverse transcription reactions, for generating cDNA from mRNAs of one or more ECS signature genes. In still other embodiments, kits of the invention may contain other reagents including, for example, nucleotides that are detectably labeled (e.g., for detecting the amplification and or hybridization of ECS signature genes), polymerases, and/or buffers.

[0015] Preferred kits of the invention comprise oligonucleotides that are capable of specifically hybridizing to at least five, and more preferably to at least 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150 or more signature genes. In other embodiments, a kit of the invention may comprise oligonucleotides that specifically hybridize to a set of ECS signature genes consisting of 150, 140, 130, 120, 110, 100, 90, 80, 70, 60, 50, 45, 40, 35, 30, 25, 20, 15, 10, 5 or fewer ECS signature genes of the invention.

[0016] 5. BRIEF DESCRIPTION OF THE DRAWINGS

[0017] FIGS. 1A-1D plot the mean abundances of each of the 8,799 probe sets measured with the Affymetrix U34A rodent chip in the frontal cortex or hippocampus of rats exposed to either acute or chronic ECS (y-axes) compared to mean abundances of those same probe sets in a sham control group α-axes). Genes denoted by white points on each of the plots are ones passed minimum expression restrictions (described in Section 7.1, infra) whose expression changed (in ECS rats compared to sham controls) by either of 1.5 or more or of 0.67 or less, and having a p values of less than 0.05. FIG. 1A plots gene expression levels measured in the frontal cortex of rats exposed to acute ECS compared to sham controls. FIG. 1B plots gene expression levels measured in the frontal cortex of rats exposed to chronic ECS compared to sham controls. FIG. 1C plots gene expression levels measured in the hippocampus of rats exposed to acute ECS compared to sham controls, and FIG. 1D plots gene expression levels measured in the hippocampus of rats exposed to chronic ECS compared to sham controls.

[0018]FIG. 2 shows a plot of a statistical evaluation demonstrating the effect of sample size on the number of identified gene changes or “hits”—i.e., measured changes in expression with p-values less than 0.01, regardless of the absolute magnitude of change in expression level. Groups of 2-10 rats were randomly composed from data of gene expression levels in the hippocampus of sham-treated (“sham”) and/or acute ECS-treated (“ECS”) rats.

[0019] The number of hits was calculated for each microarray analysis of these samples, and the process was repeated 100 times for each sample size. Values plotted in FIG. 2 are mean values ±1 standard deviation (SD). In the “mixed vs. mixed” condition, samples were randomly selected from all acute hippocampus samples, regardless of the treatment. The “theoretical chance” number of 40 gene changes is based on a p<0.01 for the 4,000 probes detected in the frontal cortex with at least one present call and a mean abundance of >100.

[0020]FIG. 3 shows a Venn diagram representation of the distribution of changes in 150 unique probe sets (of which 135 represent unique genes) in the frontal cortex and hippocampus after acute and chronic ECS. The total number of genes whose measured expression levels changed in each tissue are indicated in parentheses.

[0021] FIGS. 4A-4D show plots of the measured magnitude in expression levels for genes that were differentially expressed in the frontal cortex or hippocampus of rats treated with either acute or chronic ECS. FIGS. 4A and 4B plot changes in expression levels measured in the frontal cortex of rats treated with chronic or acute ECS, respectively, compared to sham-treated controls. FIGS. 4C and 4D plot changes in expression levels measured in the hippocampus of rats treated with chronic and acute ECS, respectively, compared to sham-treated control animals. For each gene whose expression changed (p<0.05) in treated animals compared with sham-treated control animals, the ratio of mean expression in the ECS group over the control group is plotted in order of the magnitude of changed. Decreases in the expression level of genes are plotted below the unity line, whereas increases in expression are plotted above the unity line. X-axes in all four plots are aligned and plotted at the same scale so that the width of the graph represents the number of genes affected in each direction.

[0022] 6. DETAILED DESCRIPTION

[0023] To date, the identification of therapeutic compounds to treat neuropsychiatric disorders has depended almost entirely on serendipity. That is to say, effective drugs and other therapies for such disorders have traditionally been discovered by chance and not as the result of any directed systematic screening method. Indeed, the complex polygenetic nature of neuropsychiatric disorders, the subtle structural and cellular changes that they entail, and the difficulties in diagnosing and monitoring these disorders have made traditional drug screening methods extremely difficult if not impracticable. U.S. patent application Publication No. 2003/0096264 by Altar et al. describes screening methods, referred to in that application as “Multi-Parameter High Throughput Screening” or “MPHTS,” that overcome many of these difficulties and are ideally suited for identifying effective and/or promising therapeutic compounds to treat neuropsychiatric disorders, including schizophrenia, bipolar affective disorder (BAD), autism, major depressive disorder (MDD) and psychotic depression to name a few.

[0024] Briefly, the MPHTS approach pertains to the combination of data generated from gene expression profiling coupled with methods for the systematic analysis and/or employment of such data. Using the MPHTS methods described herein, large numbers of candidate compounds may be screened (e.g., in vitro) to identify ones that are particularly promising (and, as such, most likely to be suitable) for treating a neuropsychiatric disorder in vivo (e.g., in an individual). For descriptive purposes, these assays comprise at least two tiers. The first tier involves the determination of genes involved in a particular disorder, which is preferably a neuropsychiatric disorder. However, in another embodiment which is actually preferred here, the genes identified may be ones that are associated with a particular model for a neuropsychiatric or other disorder. For instance, the examples infra describe experiments that use electroconvulsive seizures (ECS) in rats as a model for electroconvulsive therapy.

[0025] The second tier of an MPHTS method involves the implementation of systematic methods to screen test compounds. Such screening methods may be either existing assay platforms that are already known in the art, such as those described in U.S. patent aplication Publication No. 2003/0096264, or they may be novel assays that are described here for the first time. Preferably, however, the screening assays will be automated and/or will be high-throughput assays, so that a large number of test compounds can be rapidly screened with a minimal amount of labor and effort.

[0026] 6.1. Definitions

[0027] The terms used in this specification generally have their ordinary meanings in the art, withint the context of this invention and in the specific context where each term is used. Certain terms are discussed below, or else in the specification, to provide additional guidance to the practitioner in describing the compositions and methods of this invention and how they may be made and used. In accordance with the invention, there may be employed conventional techniques in the fields of molecular biology, microbiology, and recombinant DNA technology. These techniques are explained more fully in the literature and are within the skill of the art. See, for example, Sambrook, Fitsch & Maniatis, Molecular Cloning: A Laboratory Manual, Second Edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (referred to as “Sambrook et al., 1989”); DNA Cloning: A Practical Approach, Volumes I and II (D.N. Glover ed. 1985); Oligonucleotide Synthesis (M. J. Gait ed. 1984); Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins, eds. 1984); Animal Cell Culture (R. I. Freshney, ed. 1986); Immobilized Cells and Enzymes (IRL Press, 1986); B. E. Perbal, A Practical Guide to Molecular Cloning (1984); F. M. Ausubel et al. (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994). The term “neuropsychiatric disorder”, which may also be referred to as a “major mental illness disorder” or “major mental illness”, refers to a disorder which may be generally characterized by one or more breakdowns in the adaptation process. Such disorders are therefore expressed primarily in abnormalities of thought, feeling and/or behavior producing either distress or impairment of function (i.e., impairment of mental function such as with dementia or senility). Currently, individuals may be evaluated for various neuropsychiatric disorders using criteria set forth in the most recent version of the American Psychiatric Association's Diagnostic and Statistical Manual of Mental Health (DSM-IV). Exemplary neuropsychiatric disorders include, but are not limited to, schizophrenia, attention deficit disorder (ADD), schizoaffective disorder, bipolar affective disorder, unipolar affective disorder, and adolescent conduct disorder.

[0028] As used herein, the term “isolated” means that the referenced material is removed from the environment in which it is normally found. Thus, an isolated biological material can be free of cellular components; i.e., components of the cells in which the material is found or produced. In the case of nucleic acid molecules, an isolated nucleic acid includes a PCR product, an isolated mRNA, a cDNA, or a restriction fragment. In another embodiment, an isolated nucleic acid is preferably excised from the chromosome in which it may be found, and more preferably is no longer joined to non-regulatory, non-coding regions, or to other genes, located upstream or downstream of the gene contained by the isolated nucleic acid molecule when found in the chromosome. In yet another embodiment, the isolated nucleic acid lacks one or more introns. Isolated nucleic acid molecules include sequences inserted into plasmids, cosmids, artificial chromosomes, and the like. Thus, in a specific embodiment, a recombinant nucleic acid is an isolated nucleic acid. An isolated protein may be associated with other proteins or nucleic acids, or both, with which it associates in the cell, or with cellular membranes if it is a membrane-associated protein. An isolated organelle, cell, or tissue is removed from the anatomical site in which it is found in an organism. An isolated material may be, but need not be, purified.

[0029] The term “purified” as used herein refers to material that has been isolated under conditions that reduce or eliminate the presence of unrelated materials, i.e., contaminants, including native materials from which the material is obtained. For example, a purified protein is preferably substantially free of other proteins or nucleic acids with which it is associated in a cell; a purified nucleic acid molecule is preferably substantially free of proteins or other unrelated nucleic acid molecules with which it can be found within a cell. As used herein, the term “substantially free” is used operationally, in the context of analytical testing of the material. Preferably, purified material substantially free of contaminants is at least 50% pure; more preferably, at least 90% pure, and more preferably still at least 99% pure. Purity can be evaluated by chromatography, gel electrophoresis, immunoassay, composition analysis, biological assay, and other methods known in the art.

[0030] Methods for purification are well-known in the art. For example, nucleic acids can be purified by precipitation, chromatography (including preparative solid phase chromatography, oligonucleotide hybridization, and triple helix chromatography), ultracentrifugation, and other means. Polypeptides and proteins can be purified by various methods including, without limitation, preparative disc-gel electrophoresis, isoelectric focusing, HPLC, reversed-phase HPLC, gel filtration, ion exchange and partition chromatography, precipitation and salting-out chromatography, extraction, and countercurrent distribution. For some purposes, it is preferable to produce the polypeptide in a recombinant system in which the protein contains an additional sequence tag that facilitates purification, such as, but not limited to, a polyhistidine sequence, or a sequence that specifically binds to an antibody, such as FLAG and GST. The polypeptide can then be purified from a crude lysate of the host cell by chromatography on an appropriate solid-phase matrix. Alternatively, antibodies produced against the protein or against peptides derived therefrom can be used as purification reagents. Cells can be purified by various techniques, including centrifugation, matrix separation (e.g., nylon wool separation), panning and other immunoselection techniques, depletion (e.g., complement depletion of contaminating cells), and cell sorting (e.g., fluorescence activated cell sorting or “FACS”). Other purification methods are possible. A purified material may contain less than about 50%, preferably less than about 75%, and most preferably less than about 90%, of the cellular components with which it was originally associated. The “substantially pure” indicates the highest degree of purity which can be achieved using conventional purification techniques known in the art.

[0031] A “sample” as used herein refers to a biological material which can be tested, e.g., for the presence of one or more polypeptide or nucleic acids. For example, in one embodiment, a sample is a sample of nucleic acids from a cell (e.g., mRNA, or nucleic acids derived therefrom) and is tested or analyzed for the presence or absence of certain particular nucleic acid sequences, corresponding to certain genes that may be expressed by the cell. Such samples can be obtained from any source, including tissue, blood and blood cells, including circulating hematopoietic stem cells (for possible detection of protein or nucleic acids), plural effusions, cerebrospinal fluid (CSF), ascites fluid, and cell culture.

[0032] Non-human animals include, without limitation, laboratory animals such as mice, rats, rabbits, hamsters, guinea pigs, etc.; domestic animals such as dogs and cats; and, farm animals such as sheep, goats, pigs, horses, and cows. A non-human animal of the present invention may be a mammalian or non-mammalian animal; a vertebrate or an invertebrate.

[0033] In preferred embodiments, the terms “about” and “approximately” shall generally mean an acceptable degree of error for the quantity measured given the nature or precision of the measurements. Typical, exemplary degrees of error are within 20 percent (%), preferably within 10%, and more preferably within 5% of a given value or range of values. Alternatively, and particularly in biological systems, the terms “about” and “approximately” may mean values that are within an order of magnitude, preferably within 5-fold and more preferably within 2-fold of a given value. Numerical quantities given herein are approximate unless stated otherwise, meaning that the term “about” or “approximately” can be inferred when not expressly stated.

[0034] The term “molecule” means any distinct or distinguishable structural unit of matter comprising one or more atoms, and includes, for example, polypeptides and polynucleotides.

[0035] The term “aberrant” or “abnormal”, as applied herein refers to an activity or feature which differs from a normal or activity or feature, or to an activity or feature which is within normal variations of a standard value.

[0036] For example, an abnormal activity of a gene or protein refers to an activity which differs from the activity of the wild-type or native gene or protein, or which differs from the activity of the gene or protein in a healthy subject. An activity of a gene includes, for instance, the transcriptional activity of the gene which may result from, e.g., an aberrant promoter activity. Such an abnormal transcriptional activity can result, e.g., from one or more mutations in a promoter region, such as in a regulatory element thereof. An abnormal transcriptional activity can also result from a mutation in a transcription factor involved in the control of gene expression.

[0037] An activity of a protein can be aberrant because it is stronger than the activity of its native counterpart. Alternatively, an activity can be aberrant because it is weaker or absent related to the activity of its native counterpart. An aberrant activity can also be a change in an activity. For example an aberrant protein can interact with a different protein relative to its native counterpart. A cell can have an aberrant activity due to overexpression or underexpression of a gene or protein. An aberrant activity can result, e.g., from a mutation in the gene, which results, e.g., in lower or higher binding affinity of a ligand or substrate to the protein encoded by the mutated gene.

[0038] The term “therapeutically effective dose” refers to that amount of a compound or compositions that is sufficient to result in a desired activity.

[0039] The phrase “pharmaceutically acceptable” refers to molecular entities and compositions that are physiologically tolerable and do not typically produce an allergic or similar untoward reaction (for example, gastric upset, dizziness and the like) when administered to an individual. Preferably, and particularly where a pharmaceutical composition is used in humans, the term “pharmaceutically acceptable” may mean approved by a regulatory agency (for example, the U.S. Food and Drug Agency) or listed in a generally recognized pharmacopeia for use in animals (for example, the U.S. Pharmacopeia).

[0040] The term “carrier” refers to a diluent, adjuvant, excipient, or vehicle with which a compound is administered. Sterile water or aqueous saline solutions and aqueous dextrose and glycerol solutions are preferably employed as carriers, particularly for injectable solutions. Exemplary suitable pharmaceutical carriers are described in “Reminington's Pharmaceutical Sciences” by E. W. Martin.

[0041] The term “polymer” means any substance or compound that is composed of two or more building blocks (‘mers’) that are repetitively linked together. For example, a “dimer” is a compound in which two building blocks have been joined together; a “trimer” is a compound in which three building blocks have been joined together; etc.

[0042] The term “polynucleotide” or “nucleic acid molecule” as used herein refers to a polymeric molecule having a backbone that supports bases capable of hydrogen bonding to typical polynucleotides, wherein the polymer backbone presents the bases in a manner to permit such hydrogen bonding in a specific fashion between the polymeric molecule and a typical polynucleotide (e.g., single-stranded DNA). Such bases are typically inosine, adenosine, guanosine, cytosine, uracil and thymidine. Polymeric molecules include “double stranded” and “single stranded” DNA and RNA, as well as backbone modifications thereof (for example, methylphosphonate linkages).

[0043] Thus, a “polynucleotide” or “nucleic acid” sequence is a series of nucleotide bases (also called “nucleotides”), generally in DNA and RNA, and means any chain of two or more nucleotides. A nucleotide sequence frequently carries genetic information, including the information used by cellular machinery to make proteins and enzymes. The terms include genomic DNA, cDNA, RNA, any synthetic and genetically manipulated polynucleotide, and both sense and antisense polynucleotides. This includes single- and double-stranded molecules; i.e., DNA-DNA, DNA-RNA, and RNA-RNA hybrids as well as “protein nucleic acids” (PNA) formed by conjugating bases to an amino acid backbone. This also includes nucleic acids containing modified bases, for example, thio-uracil, thio-guanine and fluoro-uracil.

[0044] The polynucleotides herein may be flanked by natural regulatory sequences, or may be associated with heterologous sequences, including promoters, enhancers, response elements, signal sequences, polyadenylation sequences, introns, 5′- and 3′-non-coding regions and the like. The nucleic acids may also be modified by many means known in the art. Non-limiting examples of such modifications include methylation, “caps”, substitution of one or more of the naturally occurring nucleotides with an analog, and internucleotide modifications such as, for example, those with uncharged linkages (e.g., methyl phosphonates, phosphotriesters, phosphoroamidates, carbamates, etc.) and with charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.). Polynucleotides may contain one or more additional covalently linked moieties, such as proteins (e.g., nucleases, toxins, antibodies, signal peptides, poly-L-lysine, etc.), intercalators (e.g., acridine, psoralen, etc.), chelators (e.g., metals, radioactive metals, iron, oxidative metals, etc.) and alkylators to name a few. The polynucleotides may be derivatized by formation of a methyl or ethyl phosphotriester or an alkyl phosphoramidite linkage. Furthermore, the polynucleotides herein may also be modified with a label capable of providing a detectable signal, either directly or indirectly. Exemplary labels include radioisotopes, fluorescent molecules, biotin and the like. Other non-limiting examples of modification which may be made are provided, below, in the description of the present invention.

[0045] Specific non-limiting examples of synthetic nucleic acids envisioned for this invention include, in addition to the nucleic acid moieties described above, nucleic acids that contain phosphorothioates, phosphotriesters, methyl phosphonates, short chain alkyl, or cycloalkyl intersugar linkages or short chain heteroatomic or heterocyclic intersugar linkages. Most preferred are those with CH₂—NH—O—CH₂, CH₂—N(CH₃)—O—CH₂, CH₂—O—N(CH₃)—CH₂, CH₂—N(CH₃)—N(CH₃)—CH₂ and O—N(CH₃)—CH₂—CH₂ backbones (where phosphodiester is O—PO₂—O—CH₂). U.S. Pat. No. 5,677,437 describes heteroaromatic nucleic acid linkages. Nitrogen linkers or groups containing nitrogen can also be used to prepare nucleic acid mimics (U.S. Pat. Nos. 5,792,844 and 5,783,682). U.S. Pat. No. 5,637,684 describes phosphoramidate and phosphorothioamidate oligomeric compounds. Also envisioned are nucleic acids having morpholino backbone structures (U.S. Pat. No. 5,034,506). In other embodiments, such as the peptide-nucleic acid (PNA) backbone, the phosphodiester backbone of the nucleic acid may be replaced with a polyamide backbone, the bases being bound directly or indirectly to the aza nitrogen atoms of the polyamide backbone (Nielsen et al., Science 254:1497, 1991). Other synthetic nucleic acids may contain substituted sugar moieties comprising one of the following at the 2′ position: OH, SH, SCH₃, F, OCN, O(CH₂)_(n)NH₂ or O(CH₂)_(n)CH₃ where n is from 1 to about 10; C₁ to C₁₀ lower alkyl, substituted lower alkyl, alkaryl or aralkyl; Cl; Br; CN; CF₃; OCF₃; O-; S-, or N-alkyl; O-, S-, or N-alkenyl; SOCH₃; SO₂CH₃; ONO₂;NO₂; N₃; NH₂; heterocycloalkyl; heterocycloalkaryl; aminoalkylamino; polyalkylamino; substituted silyl; a fluorescein moiety; an RNA cleaving group; a reporter group; an intercalator; a group for improving the pharmacokinetic properties of a nucleic acid; or a group for improving the pharmacodynamic properties of an nucleic acid, and other substituents having similar properties. Nucleic acids may also have sugar mimetics such as cyclobutyls or other carbocyclics in place of the pentofuranosyl group. Nucleotide units having nucleosides other than adenosine, cytidine, guanosine, thymidine and uridine, such as inosine, may be used in an oligonucleotide molecule.

[0046] The term “oligonucleotide” refers to a nucleic acid, generally of at least 10, preferably at least 15, and more preferably at least 20 nucleotides, preferably no more than 100 nucleotides, that is hybridizable to a genomic DNA molecule, a cDNA molecule, or an mRNA molecule encoding a gene, mRNA, cDNA, or other nucleic acid of interest. Oligonucleotides can be labeled, e.g., with ³²P-nucleotides or nucleotides to which a label, such as biotin or a fluorescent dye (for example, Cy3 or Cy5) has been covalently conjugated. In one embodiment, a labeled oligonucleotide can be used as a probe to detect the presence of a nucleic acid. In another embodiment, oligonucleotides (one or both of which may be labeled) can be used as PCR primers, either for cloning full length or a fragment of a gene, or to detect the presence of nucleic acids encoding a particular gene product (e.g., to detect the presence of a particular mRNA). In a further embodiment, an oligonucleotide of the invention can form a triple helix. Generally, oligonucleotides are prepared synthetically, preferably on a nucleic acid synthesizer. Accordingly, oligonucleotides can be prepared with non-naturally occurring phosphoester analog bonds, such as thioester bonds, etc.

[0047] A “polypeptide” is a chain of chemical building blocks called amino acids that are linked together by chemical bonds called “peptide bonds”. The term “protein” refers to polypeptides that contain the amino acid residues encoded by a gene or by a nucleic acid molecule (e.g., an mRNA or a cDNA) transcribed from that gene either directly or indirectly. Optionally, a protein may lack certain amino acid residues that are encoded by a gene or by an mRNA. For example, a gene or mRNA molecule may encode a sequence of amino acid residues on the N-terminus of a protein (i.e., a signal sequence) that is cleaved from, and therefore may not be part of, the final protein. A protein or polypeptide, including an enzyme, may be a “native” or “wild-type”, meaning that it occurs in nature; or it may be a “mutant”, “variant” or “modified”, meaning that it has been made, altered, derived, or is in some way different or changed from a native protein or from another mutant.

[0048] A “ligand” is, broadly speaking, any molecule that binds to another molecule. In preferred embodiments, the ligand is either a soluble molecule or the smaller of the two molecule or both. The other molecule is referred to as a “receptor”. In preferred embodiments, both a ligand and its receptor are molecules (preferably proteins or polypeptides) produced by cells. Preferably, a ligand is a soluble molecule and the receptor is an integral membrane protein (i.e., a protein expressed on the surface of a cell). The binding of a ligand to its receptor is frequently a step of signal transduction within a cell. Exemplary ligand-receptor interactions include, but are not limited to, binding of a hormone to a hormone receptor (for example, the binding of estrogen to the estrogen receptor) and the binding of a neurotransmitter to a receptor on the surface of a neuron.

[0049] “Amplification” of a polynucleotide, as used herein, denotes the use of polymerase chain reaction (PCR) to increase the concentration of a particular DNA sequence within a mixture of DNA sequences. For a description of PCR see Saiki et al., Science 1988, 239:487.

[0050] “Chemical sequencing” of DNA denotes methods such as that of Maxam and Gilbert (Maxam-Gilbert sequencing; see Maxam & Gilbert, Proc. Natl. Acad. Sci. U.S.A. 1977, 74:560), in which DNA is cleaved using individual base-specific reactions.

[0051] “Enzymatic sequencing” of DNA denotes methods such as that of Sanger (Sanger et al., Proc. Natl. Acad. Sci. U.S.A. 1977, 74:5463) and variations thereof well known in the art, in a single-stranded DNA is copied and randomly terminated using DNA polymerase.

[0052] A “gene” is a sequence of nucleotides which code for a functional “gene product”. Generally, a gene product is a functional protein. However, a gene product can also be another type of molecule in a cell, such as an RNA (e.g., a tRNA or a rRNA). For the purposes of the present invention, a gene product also refers to an mRNA sequence which may be found in a cell. For example, measuring gene expression levels according to the invention may correspond to measuring mRNA levels. A gene may also comprise regulatory (i.e., non-coding) sequences as well as coding sequences. Exemplary regulatory sequences include promoter sequences, which determine, for example, the conditions under which the gene is expressed. The transcribed region of the gene may also include untranslated regions including introns, a 5′-untranslated region (5′-UTR) and a 3′-untranslated region (3′-UTR).

[0053] A “coding sequence” or a sequence “encoding” an expression product, such as a RNA, polypeptide, protein or enzyme, is a nucleotide sequence that, when expressed, results in the production of that RNA, polypeptide, protein or enzyme; i.e., the nucleotide sequence “encodes” that RNA or it encodes the amino acid sequence for that polypeptide, protein or enzyme.

[0054] A “promoter sequence” is a DNA regulatory region capable of binding RNA polymerase in a cell and initiating transcription of a downstream (3′ direction) coding sequence. For purposes of defining the present invention, the promoter sequence is bounded at its 3′ terminus by the transcription initiation site and extends upstream (5′ direction) to include the minimum number of bases or elements necessary to initiate transcription at levels detectable above background. Within the promoter sequence will be found a transcription initiation site (conveniently found, for example, by mapping with nuclease S1), as well as protein binding domains (consensus sequences) responsible for the binding of RNA polymerase.

[0055] A coding sequence is “under the control of” or is “operatively associated with” transcriptional and translational control sequences in a cell when RNA polymerase transcribes the coding sequence into RNA, which is then trans-RNA spliced (if it contains introns) and, if the sequence encodes a protein, is translated into that protein.

[0056] The term “express” and “expression” means allowing or causing the information in a gene or DNA sequence to become manifest, for example producing RNA (such as rRNA or mRNA) or a protein by activating the cellular functions involved in transcription and translation of a corresponding gene or DNA sequence. A DNA sequence is expressed by a cell to form an “expression product” such as an RNA (e.g., a mRNA or a rRNA) or a protein. The expression product itself, e.g., the resulting RNA or protein, may also said to be “expressed” by the cell.

[0057] The term “heterologous” refers to a combination of elements not naturally occurring. For example, the present invention includes chimeric RNA molecules that comprise an rRNA sequence and a heterologous RNA sequence which is not part of the rRNA sequence. In this context, the heterologous RNA sequence refers to an RNA sequence that is not naturally located within the ribosomal RNA sequence. Alternatively, the heterologous RNA sequence may be naturally located within the ribosomal RNA sequence, but is found at a location in the rRNA sequence where it does not naturally occur. As another example, heterologous DNA refers to DNA that is not naturally located in the cell, or in a chromosomal site of the cell. Preferably, heterologous DNA includes a gene foreign to the cell. A heterologous expression regulatory element is a regulatory element operatively associated with a different gene that the one it is operatively associated with in nature.

[0058] The terms “mutant” and “mutation” mean any detectable change in genetic material, e.g., DNA, or any process, mechanism or result of such a change. This includes gene mutations, in which the structure (e.g., DNA sequence) of a gene is altered, any gene or DNA arising from any mutation process, and any expression product (e.g., RNA, protein or enzyme) expressed by a modified gene or DNA sequence. The term “variant” may also be used to indicate a modified or altered gene, DNA sequence, RNA, enzyme, cell, etc.; i.e., any kind of mutant. For example, the present invention relates to altered or “chimeric” RNA molecules that comprise an rRNA sequence that is altered by inserting a heterologous RNA sequence that is not naturally part of that sequence or is not naturally located at the position of that rRNA sequence. Such chimeric RNA sequences, as well as DNA and genes that encode them, are also referred to herein as “mutant” sequences.

[0059] “Sequence-conservative variants” of a polynucleotide sequence are those in which a change of one or more nucleotides in a given codon position results in no alteration in the amino acid encoded at that position.

[0060] “Function-conservative variants” of a polypeptide or polynucleotide are those in which a given amino acid residue in the polypeptide, or the amino acid residue encoded by a codon of the polynucleotide, has been changed or altered without altering the overall conformation and function of the polypeptide. For example, function-conservative variants may include, but are not limited to, replacement of an amino acid with one having similar properties (for example, polarity, hydrogen bonding potential, acidic, basic, hydrophobic, aromatic and the like). Amino acid residues with similar properties are well known in the art. For example, the amino acid residues arginine, histidine and lysine are hydrophilic, basic amino acid residues and may therefore be interchangeable. Similar, the amino acid residue isoleucine, which is a hydrophobic amino acid residue, may be replaced with leucine, methionine or valine. Such changes are expected to have little or no effect on the apparent molecular weight or isoelectric point of the polypeptide. Amino acid residues other than those indicated as conserved may also differ in a protein or enzyme so that the percent protein or amino acid sequence similarity (e.g., percent identity or homology) between any two proteins of similar function may vary and may be, for example, from 70% to 99% as determined according to an alignment scheme such as the Cluster Method, wherein similarity is based on the MEGALIGN algorithm. “Function-conservative variants” of a given polypeptide also include polypeptides that have at least 60% amino acid sequence identity to the given polypeptide as determined, e.g., by the BLAST or FASTA algorithms. Preferably, function-conservative variants of a given polypeptide have at least 75%, more preferably at least 85% and still more preferably at least 90% amino acid sequence identity to the given polypeptide and, preferably, also have the same or substantially similar properties (e.g., of molecular weight and/or isoelectric point) or functions (e.g., biological functions or activities) as the native or parent polypeptide to which it is compared.

[0061] The term “homologous”, in all its grammatical forms and spelling variations, refers to the relationship between two proteins that possess a “common evolutionary origin”, including proteins from superfamilies (e.g., the immunoglobulin superfamily) in the same species of organism, as well as homologous proteins from different species of organism for example, myosin light chain polypeptide, etc.; see, Reeck et al., Cell 1987, 50:667). Such proteins (and their encoding nucleic acids) have sequence homology, as reflected by their sequence similarity, whether in terms of percent identity or by the presence of specific residues or motifs and conserved positions.

[0062] The term “sequence similarity”, in all its grammatical forms, refers to the degree of identity or correspondence between nucleic acid or amino acid sequences that may or may not share a common evolutionary origin (see, Reeck et al., supra). However, in common usage and in the instant application, the term “homologous”, when modified with an adverb such as “highly”, may refer to sequence similarity and may or may not relate to a common evolutionary origin.

[0063] In specific embodiments, two nucleic acid sequences are “substantially homologous” or “substantially similar” when at least about 80%, and more preferably at least about 90% or at least about 95% of the nucleotides match over a defined length of the nucleic acid sequences, as determined by a sequence comparison algorithm known such as BLAST, FASTA, DNA Strider, CLUSTAL, etc. An example of such a sequence is an allelic or species variant of the specific genes of the present invention. Sequences that are substantially homologous may also be identified by hybridization, e.g., in a Southern hybridization experiment under, e.g., stringent conditions as defined for that particular system.

[0064] Similarly, in particular embodiments of the invention, two amino acid sequences are “substantially homologous” or “substantially similar” when greater than 80% of the amino acid residues are identical, or when greater than about 90% of the amino acid residues are similar (i.e., are functionally identical). Preferably the similar or homologous polypeptide sequences are identified by alignment using, for example, the GCG (Genetics Computer Group, Program Manual for the GCG Package, Version 7, Madison Wis.) pileup program, or using any of the programs and algorithms described above (e.g., BLAST, FASTA, CLUSTAL, etc.).

[0065] The terms “array” and “microarray” are used interchangeably and refer generally to any ordered arrangement (e.g., on a surface or substrate) or different molecules, referred to herein as “probes”. Each different probe of an arrays specifically recognizes and/or binds to a particular molecule, which is referred to herein as its “target”. Microarrays are therefore useful for simultaneously detecting the presence or absence of a plurality of different target molecules, e.g., in a sample. In preferred embodiments, arrays used in the present invention are “addressable arrays” where each different probe is associated with a particular “address”. For example, in preferred embodiments where the probes are immobilized on a surface or a substrate, each different probe of the addressable array may be immobilized at a particular, known location on the surface or substrate. The presence or absence of that probe's target molecule in a sample may therefore be readily determined by simply determining whether a target has bound to that particular location on the surface or substrate.

[0066] In various embodiments, an array of the invention may comprise a plurality of different antibodies that each bind to a particular target protein or antigen. More preferably, however, the methods of the invention are practiced using nucleic acid arrays (also referred to herein as “transcript arrays” or “hybridization arrays”) that comprise a plurality of nucleic acid probes immobilized on a surface or substrate. The different nucleic acid probes are complementary to, and therefore hybridize, to different target nucleic acid molecules, e.g., in a sample. Thus such probes may be used to simultaneously detect the presence and/or abundance of a plurality of different nucleic acid molecules in a sample, including the expression of a plurality of different genes; e.g., the presence and/or abundance of different mRNA molecules, or of nucleic acid molecules derived therefrom (for example, cDNA or cRNA).

[0067] A nucleic acid molecule is “hybridizable” to another nucleic acid molecule, such as a cDNA, genomic DNA, or RNA, when a single stranded form of the nucleic acid molecule can anneal to the other nucleic acid molecule under the appropriate conditions of temperature and solution ionic strength (see Sambrook et al., supra). The conditions of temperature and ionic strength determine the “stringency” of the hybridization. For preliminary screening for homologous nucleic acids, low stringency hybridization conditions (e.g., 5×SSC, 0.1% SDS, and no formamide; or 30% formamide, 5×SSC, 0.5% SDS) may be used. Alternatively, hybridizations may also be performed under conditions that are relatively more stringent, such as moderately stringent hybridization conditions (e.g., 40% formamide, with 5× or 6×SCC) or high stringency hybridization conditions (e.g., 50% formamide, 5× or 6×SCC). SCC is a buffer solution commonly used for nucleic acid hybridizations and comprises 0.15 M NaCl, 0.015 M Na-citrate.

[0068] Hybridization requires that the two nucleic acids contain complementary sequences, although depending on the stringency of the hybridization, mismatches between bases are possible. The appropriate stringency for hybridizing nucleic acids depends on the length of the nucleic acids and the degree of complementation, variables well known in the art. The greater the degree of similarity or homology between two nucleotide sequences, the greater the value of T_(m) for hybrids of nucleic acids having those sequences. The relative stability (corresponding to higher T_(m)) of nucleic acid hybridizations decreases in the following order:

[0069] RNA:RNA, DNA:RNA, DNA:DNA. For hybrids of greater than 100 nucleotides in length, equations for calculating T_(m) have been derived (see Sambrook et al., supra, 9.50-9.51). For hybridization with shorter nucleic acids, i.e., oligonucleotides, the position of mismatches becomes more important, and the length of the oligonucleotide determines its specificity (see Sambrook et al., supra, 11.7-11.8). A minimum length for a hybridizable nucleic acid is at least about 10 nucleotides; preferably at least about 15 nucleotides; and more preferably the length is at least about 20 nucleotides.

[0070] Suitable hybridization conditions for oligonucleotides (e.g., for oligonucleotide probes or primers) are typically somewhat different than for full-length nucleic acids (e.g., full-length cDNA), because of the oligonucleotides' lower melting temperature. Because the melting temperature of oligonucleotides will depend on the length of the oligonucleotide sequences involved, suitable hybridization temperatures will vary depending upon the oligonucleotide molecules used. Exemplary temperatures may be 37° C. (for 14-base oligonucleotides), 48° C. (for 17-base oligonucleotides), 55° C. (for 20-base oligonucleotides) and 60° C. (for 23-base oligonucleotides). Exemplary suitable hybridization conditions for oligonucleotides include washing in 6×SSC/0.05% sodium pyrophosphate, or other conditions that afford equivalent levels of hybridization.

[0071] Preferably, nucleic acid molecules in the present invention are detected by hybridization to probes of a microarray. Hybridization and wash conditions are therefore preferably chosen so that the probe “specifically binds” or “specifically hybridizes” to a specific target nucleic acid. In other words, the nucleic acid probe preferably hybridizes, duplexes or binds to a target nucleic acid molecules having a complementary nucleotide sequence, but does not hybridize to a nucleic acid molecules having a non-complementary sequence. As used herein, one polynucleotide sequence is considered complementary to another when, if the shorter of the polynucleotides is less than or equal to about 25 bases, there are no mismatches using standard base-pairing rules. If the shorter of the two polynucleotides is longer than about 25 bases, there is preferably no more than a 5% mismatch. Preferably, the two polynucleotides are perfectly complementary (i.e., no mismatches). In can be easily demonstrated that particular hybridization conditions are suitable for specific hybridization by carrying out the assay using negative controls. See, for example, Shalon et al., Genome Research 1996, 639-645; and Chee et al., Science 1996, 274:610-614.

[0072] Optimal hybridization conditions for use with microarrays will depend on the length (e.g., oligonucleotide versus polynucleotide greater than about 200 bases) and type (e.g., RNA, DNA, PNA, etc.) of probe and target nucleic acid. General parameters for specific (i.e., stringent) hybridization conditions are described above. For cDNA microarrays, such as those described by Schena et al. (Proc. Natl. Acad. Sci. USA 1996, 93:10614), typical hybridization conditions comprise hybridizing in 5×SSC and 0.2% SDS at 65° C. for about four hours, followed by washes at 25° C. in a low stringency wash buffer (for example, 1×SSC and 0.2% SDS), and about 10 minutes washing at 25° C. in a high stringency wash buffer (for example, 0.1×SSC and 0.2% SDS). Useful hybridization conditions are also provided, e.g., in Tijessen, Hybridization with Nucleic Acid Probes, Elsevier Sciences Publishers (1996), and Kricka, Nonisotopic DNA Probe Techniques, Academic Press, San Diego Calif. (1992).

[0073] The term “expression profile” or “gene signature” refer, generally, to any description or measurement of the genes and/or nucleic acids that are expressed by a cell or organism under particular conditions. For example, an expression profile may be measured under particular conditions of growth, for example at a particular temperature, in the presence or absence of particular growth media, and/or in the presence or absence of particular nutrients. In preferred embodiments, gene signatures may be obtained, e.g., for cells or tissues that are derived from an individual or individuals having a neuropsychiatric disorder. Gene signatures may also be obtained for a cell or organism exposed to one or more particular drugs or other compounds, such as for a cell or organism exposed to a known therapeutic compound (e.g., with a known use for treating a neuropsychiatric disorder) or for a cell or organism exposed to a “test” or “candidate” compound (e.g., as part of a MPHTS assay). An expression profile or gene signature may comprise a description of particular genes that are expressed by a cell or organism, a description of the level or abundance with which genes are expressed in a cell or organism, or both. Accordingly, the term “signature gene” is used herein to refer to a gene that may be used, either alone or with other genes (e.g., as part of a gene signature) to characterize a particular condition such as the presence or absence of a neuropsychiatric disorder.

[0074] Preferably, an expression profile will comprise a list of different mRNA species that are expressed by a cell and their relative abundances. For example, mRNA abundances can be measured using a microarray, as described below. In more preferable embodiments, nucleic acids (e.g., mRNA) expressed by a cell are reversed transcribed into either cDNA or cRNA, and the abundances of the cDNA and/or cRNA molecules are measured.

[0075] 6.2. Multi-Parameter High Throughput Screening (MPHTS)

[0076] MPHTS methods are described in U.S. patent application Publication No. 2003/0096264 by Altar et al. Such methods generally comprise the following five elements. The skilled artisan will appreciate, however, that the invention may be practiced omitting one or more of these elements and without executing the recited elements in any particular order. For example, in certain embodiments, some of the below-described elements may be obtained from another source, such as from an online database. The invention may therefore be practiced without necessarily performing each of these elements, e.g., as a separate step in a screening method.

[0077] First, gene-signatures are obtained or provided by measuring expression levels for a plurality of genes in cells or tissues derived from an individual having a neuropsychiatric disorder or from a suitable model. For instance, in the Examples, infra, electroconvulsive seizures (ECS) induced in rats are used as models for electroconvulsive therapy (ECT) in human patients. Gene signatures are therefore obtained by measuring expression levels for a plurality of genes in the cells or tissues (more specifically in frontal cortex and hippocampus tissues) of animals subjected to ECS. In alternative embodiment, the cells and/or tissues are brain cells or tissues can be derived from human psychiatric patients (for example, in post mortem tissue samples). However, brain and other neuronal cells or tissues from other species of organisms may also be used, such as from a mouse, a rat, a primate or another species of mammal. Preferably, the organism from which the brain cells or tissue are derived represents an acceptable animal model for a neuropsychiatric disorder. Preferably, the expression levels measured in the cells or tissues are compared to expression levels from normal cells or tissues (i.e., brain cells or tissues from healthy individuals, not affected by a neuropsychiatric disorder) to identify particular genes that are differentially expressed in cells from an individual having a neuropsychiatric disorder compared to one who does not have a neuropsychiatric disorder.

[0078] Second, gene-signatures may also be obtained or provided by measuring expression levels for a plurality of genes in cultured neuronal cells or tissues (e.g., in cultured neurons that are derived from neural stem cells or from other neuronal cell lines). Human neurons and/or neuronal cell lines are particularly preferred. However, the cells may be obtained or derived from any species of organism, particularly a mammalian species such as a mouse, a rat or a primate. Similarly, the cultured neuronal tissues may also be obtained from any species of mammal, such as from a rat, a mouse, a primate or a human.

[0079] For example, and not by way of limitation, a mouse neuroblastoma cell line may be used in such methods. Such cells are readily available, e.g., from the American Type Culture Collection (“ATCC”, Manasas Virginia). See, for example, ATCC Accession No. CRL-2263. As another non-limiting example, U.S. provisional patent application serial No. 60/299,066 filed on Jun. 18, 2001 describes the use of rat neuronal cell cultures to evaluate neuropsychiatric drugs. Such cells may also be used in the MPHTS methods of this invention.

[0080] Third, drug signatures may also be obtained or provided by measuring expression levels for a plurality of genes in cultured neuronal cells or tissues that are treated with a therapeutic compound. The cultured cells may be any type of neuronal cell or cell lines described supra for obtaining gene-signatures from a cell line. Similarly, any of the types of tissue cultures described, supra, may also be used to obtain drug signatures. Preferably, the drug signatures are signatures for compounds that are known to be effective for treating a neuropsychiatric disorder. Exemplary compounds may include valproate, buspirone, lithium, carbamazepine, clozapine, olanzapine, haloperidol, secretin and vasoactive intestinal polypeptide (VIP), to name a few. Exemplary drug signatures, which were obtained from broth rat and human neuronal cells treated with therapeutic compounds, are provided in the Examples, infra. Other drug signatures may be readily obtained by those skilled in the art.

[0081] Fourth, expression levels for the plurality of genes are obtained or provided in neuronal cells that are contacted with a test compound (referred to here as a “drug candidate”), and these expression levels may then be compared to expression levels from gene signatures obtained for the neuropsychiatric disorder (as described in the first element, supra) and/or to drug-signatures obtained the known therapeutic compound (as described in the third element, supra). In preferred embodiments, expression levels or “signatures” obtained from a test compound are also compared to expression levels when the cell or cell line is not contacted with the test compound or any other drug (described in the second element, supra).

[0082] Generally speaking, the “signature” or expression levels obtained when the neuronal cells are contacted with a test compound are compared to the gene signatures of the cells when they are not contacted with any test or therapeutic compound (i.e., the gene signature obtained as element two, described supra) to identify changes in the expression level(s) for particular genes. Similarly, the drug-signature (obtained as described, supra, for element three) is also compared to the neuronal cell lines gene signature, to identify particular genes whose expression levels change when the cells are contacted with the therapeutic compound. In instances where changes in expression levels when the cells are contacted with the test compound are identical (or at least similar) to changes in expression levels when the cell are contacted with the known therapeutic compound, then the test compound is identified as a candidate compound for treating the neuropsychiatric disorder. Thus, using these screening methods a skilled artisan is able to rapidly and inexpensively identify compounds that are most promising as novel neuropsychiatric drugs, while eliminating compounds that show little promise and/or are unlikely candidates for treating a neuropsychiatric disorder.

[0083] In preferred embodiments of the invention, changes in expression levels when the cells are contacted with the test compound may also be compared to gene signatures obtained for the particular neuropsychiatric disorder of interest (i.e., to the gene signatures obtained as described, supra, for the first element). Preferably, a test compound that is identified as a candidate therapeutic compound will alter the expression of “signature gene” in a way that is opposite or contrary to the expression observed in the disorder's gene signature. For example, where a particular gene is expressed at abnormally high levels in cells or tissues from individuals affected by the particular neuropsychiatric disorder (compared to expression levels in cells or tissues from individuals not affected by the disorder), a candidate compound identified in these screening methods will preferably inhibit that gene's expression (i.e., the gene is preferably expressed at lower levels when the cells are contacted with the test compound, compared to its expression when the cell is not contacted with the test compound).

[0084] Fifth, as an optional element of the invention, drug candidate or candidate compounds that are identified as described, supra, may be further optimized, e.g., to account for individual genetic variability.

[0085] As indicated above, the MPHTS assays of the invention are useful as an inexpensive and rapid initial screening to quickly identify compounds that are most promising as neuropsychiatric drugs, while quickly eliminating compounds that show little promise and/or are unlikely candidates for treating a neuropsychiatric disorder. In preferred embodiments, the MPHTS assays are used to identify candidate compounds for treating bipolar affective disorder (BAD), depression, schizophrenia and autism. However, the assays are by no means limited to these particular disorders, and may be readily adapted to identify candidate compounds for treating any neuropsychiatric disorder. Other exemplary, preferred neuropsychiatric disorders for which these assays may be used include anxiety disorders, eating disorders, addictive disorders and Attention Deficit Hyperactivity Disorder (ADHD).

[0086] Classes of compounds that may be identified by such screening assays include, but are not limited to, small molecules (e.g., organic or inorganic molecules which are less than about 2 kd in molecular weight, are more preferably less than about 1 kd in molecular weight, and/or are able to cross the blood-brain barrier or gain entry into an appropriate cell, as well as macromolecules (e.g., molecules greater than about 2 kd in molecular weight). In preferred embodiments, commercially available compound libraries may be purchased and screened in an MPHTS assay of the invention. Examples of preferred libraries include TOCRIS (Tocris Cookson, Ltd. Avonmouth Bristol, United Kingdom), SIGMA RBI (Sigma Aludrich Inc., St. Louis Mo.), ChemBridge (ChemBridge Corp., San Diego Calif.), Chemdiv (ChemDiv Inc., San Diego Calif.) and Prestwick (Prestwick Chemical, Inc., Washington D.C.), to name a few.

[0087] The selection of appropriate small molecule compound concentrations for the treatment of cells in vitro or for dosing of animals in vivo is preferred to discriminate between physiological and toxicological effects of a given compound. As an initial means for determining the deleterious effects of a compound or set of compounds, cells may be seeded (e.g., in multiple-well plates) and treated with a range of compound concentrations.

[0088] The compounds' effect (e.g., its cytotoxic or apoptotoic effect) may then be gauged, e.g., using commercially available kits and routine methods well known in the art.

[0089] Compounds identified by these screening assays may also include peptides and polypeptides. For example, soluble peptides, fusion peptides members of combinatorial libraries (such as ones described by Lam et al., Nature 1991, 354:82-84; and by Houghten et al., Nature 1991, 354:84-86); members of libraries derived by combinatorial chemistry, such as molecular libraries of D- and/or L-configuration amino acids; phosphopeptides, such as members of random or partially degenerate, directed phosphopeptide libraries (see, e.g., Songyang et al., Cell 1993, 72:767-778); antibodies, including but not limited to polyclonal, monoclonal, humanized, anti-idiotypic, chimeric, or single chain antibodies; antibody fragments, including but not limited to FAb, F(ab′)₂, FAb expression library fragments and epitope-binding fragments thereof.

[0090] The compounds used in such screening assays are also preferably essential pure and free of contaminants which may, themselves, alter or influence gene expression. Compound purity may be assessed by any number of means that are routine in the art, such as LC-MS and NMR spectroscopy. Libraries of test compounds are also preferably biased by using computational selection methods which are routine in the art. Tools for such computational selection, such as Pipeline PilotJ (Scitegic Inc., San Diego, Calif.) are commercially available. The compounds may be assessed using rules such as the “Lipinski criteria” (see, Lipinski et al., Adv. Drug Deliv. Rev. 2001, 46:3-26) and/or any other criteria or metrics commonly used in the art.

[0091] 6.3. Preparation of Neuronal Cell and Tissue Samples

[0092] Brain tissue samples. In certain limited embodiments, brain cells and tissues for use in the MPHTS methods of this invention may be obtained from individuals (e.g., from patients) in a biopsy. However, those skilled in the art will recognize that brain surgeries permitting a biopsy are relatively rare and primarily involve surgical excisions (e.g., for the treatment of epilepsy) rather than brain regions relevant to neuropsychiatric disorder such as schizophrenia or bipolar affective disorder. In certain embodiments, however, useful disease profiles may be obtained from cultured peripheral nervous system neurons, such as rhinoneuroepithelial cells. Such cells may be readily obtained from a nasal biopsy, and disease profiles from such cells may be used to identify changes in gene expression that are associated with neuropsychiatric disorders such as schizophrenia.

[0093] In preferred embodiments, brain cells or tissues used in the methods of this invention are instead obtained post-mortem, e.g., from cadavers of individuals who had or exhibited symptoms of a neuropsychiatric disorder during their lifetime.

[0094] Those skilled in the art will readily appreciate that a large number of carefully collected brain tissue samples should preferably be obtained to assure statistical reliability (see, for example, Torrey et al., SchizophrRes. 2000,44:151; Bahn et al., J. Chem. Neuroanatomy 2001, 22:79-94; and Vawter et al., Brain Res. Bull. 2001, 55:641-650). This is particularly desirable where there is considerable heterogeneity in patient age to permit accounting for age-associated variables (for example, progressive brain degeneration, which may also occur in schizophrenia). However, smaller samples may be used, e.g., for preliminary screening assays where statistical reliability may not be as essential. It is also preferable that the samples be matched, e.g., according to the patients' age, sex, cause of death and post-mortem interval. The brain samples used preferably are not acquired from cadavers under circumstances that might themselves affect the quality of the cells or tissues acquired. For example, samples obtained following a prolonged moribund state, a coma, hpoxia, pyrexia or stroke preferably are not used in MPHTS methods of the invention. A skilled artisan may readily recognize such compromised, ante mortem states, e.g., from the extent of brain acidosis. Generally, measured postmortem tissue pH values that are below about 6.4 indicate that the tissue has been subjected to such a compromised ante mortem state and should not be used. In addition, the postmortem tissue pH value is also critical to the integrity of mRNA obtained from the tissue.

[0095] It is understood that a reliable psychiatric diagnosis and cause of death should also be obtained or determined for the individual. It is, moreover, additionally preferably to identify factors such as concomitant medical conditions, medications taken during the patient's lifetime (particularly immediately prior to death), surgical treatments (including cancer treatments) and substance abuse for each patient. The hemisphere and region of the brain from which each sample is taken is also preferably noted and recorded.

[0096] Generally, samples that have been subject to such conditions as may affect the reliability of gene expression measurements should not be used. However, in many situations the skilled artisan will recognize that such factors may be sufficiently controlled for and the sample, therefore, acceptable for use in MPHTS. In such cases, however, it is preferable and often essential that the samples be appropriately matched. As an example, and not by way of limitation, it is recognized that smoking alters the expression of many genes in the hippocampus, a region of the brain that is also associated with schizophrenia (Wang et al., Abs. Soc. Neurosci. 2001, 27). However, the overlap between genes whose expression levels have been reported as altered by those two conditions is believed to be minimal (see, Wang et al., supra). Therefore, it may be possible to practice MPHTS methods of the invention using samples from smoking or non-smoking individuals, provided the samples are appropriately matched.

[0097] Those skilled in the art will also appreciate that the levels and quality of RNA extracted from post-mortem samples may be influenced by factors such as the post mortem interval (i.e., the time interval between death and RNA extraction), the refrigeration time (i.e., the time interval from death to patient storage in a cold environment), the storage time (i.e., the duration of time during which the cadaver is refrigerated). Accordingly, it is preferably that such factors be appropriately controlled and that the steps of RNA extraction from these tissue samples be as efficient as possible. In particularly preferred embodiments, the brain or tissue samples are unfixed (i.e., are not treated with protein cross-linkers such as formalin) and have not been thawed more than once.

[0098] In a preferred embodiment, samples of brain tissue may be obtained, e.g., post-mortem from cadavers of individuals who (during their lifetime) suffered from or exhibited symptoms of a neuropsychiatric disorder. However, single neurons or groups of homogeneous neurons may also be extracted from such cadavers, e.g., by laser capture microdissection (LCM). Using RNA amplification, gene expression profiles may be measured for these single cells as well (see, e.g., Eberwine et al., Proc. Natl. Acad. Sci. 1992, 89:30130-30134; and Luo et al., Nature Med. 1999, 5:117-119). Expression profiles obtained from these cells will therefore be particular for the particular cell types extracted, and may ultimately provide gene expression profiles that are more clearly ascribed to the particular cell population. Such gene profiles will typically be more robust, and therefore preferable, for evaluating a drug response.

[0099] Brain cells or tissues obtained from animals may also be used. For example, tissue or samples from animal models for a neuropsychiatric disorder may be used to model disease profiles for that disorder. Alternatively, expression profiles may be obtained from brain cells or tissues obtained from animals treated with a known anti-psychotic drug or with a test compound. In addition, cells from a transgenic animals may be employed, in which one or more genes relevant to a neuropsychiatric disorder have been altered, over-expressed or “knocked-out”. High throughput in vitro screening of candidate compounds may then be carried out using neuronal cells obtained or derived from such a transgenic animal.

[0100] Neuronal cells. In preferred embodiments, the MPHTS methods of the invention also used cultured cells or cell lines to screen for candidate therapeutic compounds. Preferably, the cells are ones having an expression profile that is typical of neuronal cells or, alternatively, they may be cells which can be manipulated to produce an expression profile typical of neuronal cells. The cells or cell lines used will also, preferably, give rise to reproducible changes in their gene expression profiles when contacted with known antipsychiatric drugs (for example, valproate). In a particularly preferred embodiment, these changes will be opposite changes that are observed in the disease signature. That is to say, in such embodiments, genes (or their homologs) normally expressed at higher levels in the disease signature are preferably expressed at lower levels in cells or cell lines contacted with the known antipsychiatric drug, and vice-versa.

[0101] In a preferred embodiment, pluripotent neuronal stem cell lines are used in these aspects of the invention. Such cell lines are well known in the art, and methods to induce or enhance the differentiation of such stem cell lines have been described. For example, U.S. Provisional Patent Application Publication Nos. 2003/0082802 A1 and 2003/0013192 A1 both describe methods for inducing differentiation in neuronal stem cells by exposure to chemicals (for example, valproate and buspirone). In other embodiments, such cells may be differentiated, e.g., using antisense strategies and/or routine techniques of molecular biology to develop stable, transfected cell lines. Alternatively, however, cells or cell lines may also be obtained from patients having a neuropsychiatric disorder of interest.

[0102] A skilled artisan will readily appreciate that cells or cell cultures used in the methods of this invention should be carefully controlled for parameters such as the cell passage number, cell density (e.g., in microplate wells), the method(s) by which cells are dispensed, and growth time after dispensing. It is also preferable to repeat mRNA and/or protein expression levels measured for a cell or cell line under particular conditions, to confirm that the measured levels are reproducible.

[0103] 6.4. Measuring Gene Expression

[0104] MPHTS methods generally, including the particular methods of this invention, may be implemented using any method suitable for measuring changes in the gene expression of a cell or cells. Such methods are well known and routinely used in the art. In preferred embodiments, methods are used that permit the simultaneous measurement of expression for a plurality of genes (e.g., for at least 10, more preferably for at least 100, still more preferably for at least 150). For example, in particularly preferred embodiments expression profiles are measured using “transcript arrays” or “microarrays,” as described below. However, any technique that is capable of measuring gene expression may be used and the methods of this invention are not limited to the use of nucleic acid microarrays. For instance, gene expression can also be measured in alternative embodiments by using a reverse transcription polymerase chain reaction (RT-PCR).

[0105] Systems and kits for implementing such assays are commercially available from a number of suppliers, including Affymetrix (Santa Clara, Calif.), Agilent (Palo Alto, Calif.), Promega (Madison, Wis.), Xantnon (Research Triangle Park, North Carolina), Illumina (San Diego, Calif.), Chromagen (San Diego, Calif.), Third Wave Technologies (Madison, Wis.), Aclara (Mountain View, Calif.), Beckton Dickinson & Co. (Franklin Lakes, N.J.) and Luminex (Austin, Tex.) to name a few.

[0106] Transcript arrays. In a preferred embodiment the present invention makes use of “transcript arrays” (also called herein “microarrays”). Transcript arrays can be employed for analyzing the steady state level of mRNAs in a cell, and especially for comparing the steady state levels between two cells, such as a first cell that has been exposed to a drug, drug candidate or other compound, and a second cell that has not been treated.

[0107] In one embodiment, transcript arrays are produced by hybridizing detectably labeled polynucleotides representing the mRNA transcripts present in a cell (e.g., fluorescently labeled cDNA synthesized from total cell mRNA) to a microarray. As explained in the definitions, supra, microarray is a surface with an ordered array of binding (e.g., hybridization) sites for products of many of the genes in the genome of a cell or organism, preferably most or almost all of the genes. Microarrays can be made in a number of ways, of which several are described below. However produced, microarrays share certain characteristics. The arrays are preferably reproducible, allowing multiple copies of a given array to be produced and easily compared with each other. Preferably the microarrays are small, usually smaller than 5 cm², and they are made from materials that are stable under binding (e.g., nucleic acid hybridization) conditions. A given binding site or unique set of binding sites in the microarray will specifically bind the product of a single gene in the cell. Although there may be more than one physical binding site (hereinafter “site”) per specific mRNA, for the sake of clarity the discussion below will assume that there is a single site. It will be appreciated that when cDNA complementary to the RNA of a cell is made and hybridized to a microarray under suitable hybridization conditions, the level of hybridization to the site in the array corresponding to any particular gene will reflect the prevalence in the cell of mRNA transcribed from that gene. For example, when detectably labeled (e.g., with a fluorophore) cDNA complementary to the total cellular mRNA is hybridized to a microarray, the site on the array corresponding to a gene (i.e., capable of specifically binding a nucleic acid product of the gene) that is not transcribed in the cell will have little or no signal, and a gene for which the encoded mRNA is prevalent will have a relatively strong signal.

[0108] In preferred embodiments, cDNAs from two different cells, e.g., a cell exposed to a test compound and a cell of the same type not exposed to the compound, are hybridized to the binding sites of the microarray. The cDNA derived from each of the two cell types are differently labeled so that they can be distinguished. In one embodiment, for example, cDNA from a cell treated with a drug is synthesized using a fluorescein-labeled dNTP, and cDNA from a second cell, not drug-exposed, is synthesized using a rhodamine-labeled dNTP. When the two cDNAs are mixed and hybridized to the microarray, the relative intensity of signal from each cDNA set is determined for each site on the array, and any relative difference in abundance of a particular mRNA detected.

[0109] In the example described above, the cDNA from the treated cell will fluoresce green when the fluorophore is stimulated and the cDNA from the untreated cell will fluoresce red. As a result, when the compound has no effect, either directly or indirectly, on the relative abundance of a particular mRNA in a cell, the mRNA will be equally prevalent in both cells and, upon reverse transcription, red-labeled and green-labeled cDNA will be equally prevalent. When hybridized to the microarray, the binding site(s) for that species of RNA will emit wavelengths characteristic of both fluorophores. In contrast, when the cell is exposed to a compound that, directly or indirectly, increases the prevalence of the mRNA in the cell, the ratio of green to red fluorescence will increase. When the drug decreases the mRNA prevalence, the ratio will decrease.

[0110] The use of a two-color fluorescence labeling and detection scheme to define alterations in gene expression has been described, e.g., in Shena et al., Science 1995, 270:467-470. An advantage of using cDNA labeled with two different fluorophores is that a direct and internally controlled comparison of the mRNA levels corresponding to each arrayed gene in two cell states can be made, and variations due to minor differences in experimental conditions (e.g., hybridization conditions) will not affect subsequent analyses. However, it will be recognized that it is also possible to use cDNA from a single cell, and compare, for example, the absolute amount of a particular mRNA in, e.g., a treated and untreated cell.

[0111] Preparation of microarrays. Nucleic acid microarrays are known in the art and preferably comprise a surface to which probes that correspond in sequence to gene products (e.g., cDNAs, mRNAs, cRNAs, polypeptides, and fragments thereof), can be specifically hybridized or bound at a known position. In one embodiment, the microarray is an array in which each position represents a discrete binding site for a product encoded by a gene (e.g., a protein or RNA), and in which binding sites are present for products of most or almost all of the genes in the organism's genome. In a preferred embodiment, the “binding site” (hereinafter, “site”) is a nucleic acid or nucleic acid analogue to which a particular cognate cDNA or CRNA can specifically hybridize. The nucleic acid or analogue of the binding site can be, e.g., a synthetic oligomer, a full-length cDNA, a less-than full-length cDNA, or a gene fragment.

[0112] Although in a preferred embodiment the microarray contains binding sites for products of all or almost all genes in the target organism's genome, such comprehensiveness is not necessarily required. Usually the microarray will have binding sites corresponding to at least about 50% of the genes in the genome, often at least about 75%, more often at least about 85%, even more often more than about 90%, and most often at least about 99%. Preferably, the microarray has binding sites for genes relevant to the action of a drug of interest. A “gene” is identified as a segment of DNA containing an open reading frame (ORF) of preferably at least 50, 75, or 99 amino acids from which a messenger RNA is transcribed in the organism (e.g., if a single cell) or in some cell in a multicellular organism. The number of genes in a genome can be estimated from the number of mRNAs expressed by the organism, or by extrapolation from a well-characterized portion of the genome. When the genome of the organism of interest has been sequenced, the number of ORFs can be determined and mRNA coding regions identified by analysis of the DNA sequence.

[0113] Preparing nucleic acids for microarrays. As noted above, the “binding site” to which a particular cognate cDNA specifically hybridizes is usually a nucleic acid or nucleic acid analogue attached at that binding site. In one embodiment, the binding sites of the microarray are DNA polynucleotides corresponding to at least a portion of each gene in an organism's genome. These DNAs can be obtained by, e.g., polymerase chain reaction (PCR) amplification of gene segments from genomic DNA, cDNA (e.g., by RT-PCR), or cloned sequences. PCR primers are chosen, based on the known sequence of the genes or cDNA, that result in amplification of unique fragments (i.e. fragments that do not share more than 10 bases of contiguous identical sequence with any other fragment on the microarray). Computer programs are useful in the design of primers with the required specificity and optimal amplification properties. See, e.g., Oligo version 5.0 (National Biosciences). In the case of binding sites corresponding to very long genes, it will sometimes be desirable to amplify segments near the 3′ end of the gene so that when oligo-dT primed cDNA probes are hybridized to the microarray, less-than-full length probes will bind efficiently. Typically each gene fragment on the microarray will be between about 50 bp and about 2000 bp, more typically between about 100 bp and about 1000 bp, and usually between about 300 bp and about 800 bp in length. PCR methods are well known and are described, for example, in Innis et al., eds., 1990, PCR Protocols: A Guide to Methods and Applications, Academic Press Inc. San Diego, Calif. It will be apparent that computer controlled robotic systems are useful for isolating and amplifying nucleic acids.

[0114] An alternative means for generating the nucleic acid for the microarray is by synthesis of synthetic polynucleotides or oligonucleotides, e.g., using N-phosphonate or phosphoramidite chemistries (Froehler et al., Nucleic Acid Res. 1986, 14:5399-5407; McBride et al., Tetrahedron Lett. 1983, 24:245-248). Synthetic sequences are between about 15 and about 500 bases in length, more typically between about 20 and about 50 bases. In some embodiments, synthetic nucleic acids include non-natural bases, e.g., inosine. As noted above, nucleic acid analogues may be used as binding sites for hybridization. An example of a suitable nucleic acid analogue is peptide nucleic acid (see, for example, Egholm et al., Nature 1993, 365:566-568. See, also, U.S. Pat. No. 5,539,083).

[0115] In an alternative embodiment, the binding (hybridization) sites are made from plasmid or phage clones of genes, cDNAs (e.g., expressed sequence tags), or inserts therefrom (Nguyen et al., Genomics 1995, 29:207-209). In yet another embodiment, the polynucleotide of the binding sites is RNA.

[0116] Attaching nucleic acids to the solid surface. The nucleic acids or analogues are attached to a solid support, which may be made from glass, plastic (e.g., polypropylene, nylon), polyacrylamide, nitrocellulose, or other materials. A preferred method for attaching the nucleic acids to a surface is by printing on glass plates, as is described generally by Schena et al., Science 1995, 270:467-470. This method is especially useful for preparing microarrays of cDNA. See also DeRisi et al., Nature Genetics 1996, 14:457-460; Shalon et al., Genome Res. 1996, 6:639-645; and Schena et al., Proc. Natl. Acad. Sci. USA 1995, 93:10539-11286.

[0117] A second preferred method for making microarrays is by making high-density oligonucleotide arrays. Techniques are known for producing arrays containing thousands of oligonucleotides complementary to defined sequences, at defined locations on a surface using photolithographic techniques for synthesis in situ (see, Fodor et al., Science 1991, 251:767-773; Pease et al., Proc. Natl. Acad. Sci. USA 1994, 91:5022-5026; Lockhart et al., Nature Biotech. 1996, 14:1675. See, also, U.S. Pat. Nos. 5,578,832; 5,556,752; and 5,510,270) or other methods for rapid synthesis and deposition of defined oligonucleotides (Blanchard et al., Biosensors&Bioelectronics 1996, 11:687-90). When these methods are used, oligonucleotides (e.g., 20-mers) of known sequence are synthesized directly on a surface such as a derivatized glass slide. Usually, the array produced is redundant, with several oligonucleotide molecules per RNA. Oligonucleotide probes can be chosen to detect alternatively spliced mRNAs.

[0118] Other methods for making microarrays, e.g., by masking (Maskos and Southern, Nuc. Acids Res. 1992, 20:1679-1684), may also be used. In principal, any type of array, for example, dot blots on a nylon hybridization membrane (see, Sambrook et al., Molecular Cloning—A Laboratory Manual (2nd Ed.), Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 1989), could be used, although, as will be recognized by those of skill in the art, very small arrays will be preferred because hybridization volumes will be smaller.

[0119] Generating labeled probes. Methods for preparing total and poly(A)+ RNA are well known and are described generally in Sambrook et al., supra. In one embodiment, RNA is extracted from cells of the various types of interest in this invention using guanidinium thiocyanate lysis followed by CsCl centrifugation (Chirgwin et al., Biochemistry 1979, 18:5294-5299). Poly(A)⁺ RNA is selected by selection with oligo-dT cellulose (see Sambrook et al., supra). Cells of interest may include, but are not limited to, wild-type cells, drug-exposed wild-type cells, modified cells, and drug-exposed modified cells.

[0120] Labeled cDNA is prepared from mRNA by oligo dT-primed or random-primed reverse transcription, both of which are well known in the art (see, for example, Klug & Berger, Methods Enzymol. 1987, 152:316-325). Reverse transcription may be carried out in the presence of a dNTP conjugated to a detectable label, most preferably a fluorescently labeled dNTP. Alternatively, isolated mRNA can be converted to labeled antisense RNA synthesized by in vitro transcription of double-stranded cDNA in the presence of labeled NTPs (Lockhart et al., Nature Biotech. 1996, 14:1675). In alternative embodiments, the cDNA or RNA probe can be synthesized in the absence of detectable label and may be labeled subsequently, e.g., by incorporating biotinylated dNTPs or NTP, or some similar means (e.g., photo-cross-linking a psoralen derivative of biotin to RNAs), followed by addition of labeled streptavidin (e.g., phycoerythrin-conjugated streptavidin) or the equivalent.

[0121] When fluorescently-labeled probes are used, many suitable fluorophores are known, including fluorescein, lissamine, phycoerythrin, rhodamine (Perkin Elmer Cetus), Cy2, Cy3, Cy3.5, Cy5, Cy5.5, Cy7, Fluor X (Amersham) and others (see, e.g., Kricka, 1992, Nonisotopic DNA Probe Techniques, Academic Press San Diego, Calif.). It will be appreciated that pairs of fluorophores are chosen that have distinct emission spectra so that they can be easily distinguished.

[0122] In another embodiment, a label other than a fluorescent label is used. For example, a radioactive label, or a pair of radioactive labels with distinct emission spectra, can be used (see Zhao et al., Gene 1995, 156:207; Pietu et al., Genome Res. 1996, 6:492). However, because of scattering of radioactive particles, and the consequent requirement for widely spaced binding sites, use of radioisotopes is a less-preferred embodiment.

[0123] In one embodiment, labeled cDNA is synthesized by incubating a mixture containing 0.5 mM dGTP, dATP and dCTP plus 0.1 mM dTTP plus fluorescent deoxyribonucleotides (e.g., 0.1 mM Rhodamine 110 UTP (Perken Elmer Cetus) or 0.1 mM Cy3 dUTP (Amersham)) with reverse transcriptase (e.g., SuperScript.TM. II, LTI Inc.) at 42° C. for 60 min.

[0124] Hybridization to microarrays. Nucleic acid hybridization and wash conditions are chosen so that the probe “specifically binds” or “specifically hybridizes” to a specific array site, i.e., the probe hybridizes, duplexes or binds to a sequence array site with a complementary nucleic acid sequence but does not hybridize to a site with a non-complementary nucleic acid sequence. As used herein, one polynucleotide sequence is considered complementary to another when, if the shorter of the polynucleotides is less than or equal to 25 bases, there are no mismatches using standard base-pairing rules or, if the shorter of the polynucleotides is longer than 25 bases, there is no more than a 5% mismatch. Preferably, the polynucleotides are perfectly complementary (no mismatches). It can easily be demonstrated that specific hybridization conditions result in specific hybridization by carrying out a hybridization assay including negative controls (see, e.g., Shalon et al., supra; and Chee et al., supra).

[0125] Optimal hybridization conditions will depend on the length (e.g., oligomer versus polynucleotide greater than 200 bases) and type (e.g., RNA, DNA, PNA) of labeled probe and immobilized polynucleotide or oligonucleotide. General parameters for specific (i.e., stringent) hybridization conditions for nucleic acids are described in the definitions provided in Section 5.1, supra. When cDNA microarrays, such as those described by Schena et al. are used, typical hybridization conditions are hybridization in 5×SSC plus 0.2% SDS at 65° C. for 4 hours, followed by washes at 25° C. in low stringency wash buffer (e.g., 1×SSC plus 0.2% SDS) followed by 10 minutes at 25° C. in high stringency wash buffer (0.1×SSC plus 0.2% SDS). See, Shena et al., Proc. Natl. Acad. Sci. USA 1996, 93:10614). Useful hybridization conditions are also provided in, e.g., Tijessen, 1993, Hybridization With Nucleic Acid Probes, Elsevier Science Publishers B.V. See, also, Kricka, 1992, Nonisotopic DNA Probe Techniques, Academic Press, San Diego, Calif.

[0126] Signal detection and analysis. When fluorescently labeled probes are used, the fluorescence emissions at each site of a transcript array can be preferably detected by scanning confocal laser microscopy. In one embodiment, a separate scan, using the appropriate excitation line, is carried out for each of the two fluorophores used. Alternatively, a laser can be used that allows simultaneous specimen illumination at wavelengths specific to the two fluorophores and emissions from the two fluorophores can be analyzed simultaneously (see, Shalon et al., Genome Research 1996, 6:639-645). In a preferred embodiment, the arrays are scanned with a laser fluorescent scanner with a computer controlled X-Y stage and a microscope objective. Sequential excitation of the two fluorophores is achieved with a multi-line, mixed gas laser and the emitted light is split by wavelength and detected with two photomultiplier tubes. Fluorescence laser scanning devices are described in Schena et al., Genome Res. 1996, 6:639-645 and in other references cited herein. Alternatively, the fiber-optic bundle described by Ferguson et al., Nature Biotech. 1996, 14:1681-1684, may be used to monitor mRNA abundance levels at a large number of sites simultaneously.

[0127] Signals are recorded and, in a preferred embodiment, analyzed by computer, e.g., using a 12 bit analog to digital board. In one embodiment the scanned image is despeckled using a graphics program (e.g., Hijaak Graphics Suite) and then analyzed using an image gridding program that creates a spreadsheet of the average hybridization at each wavelength at each site. If necessary, an experimentally determined correction for “cross talk” (or overlap) between the channels for the two fluors may be made. For any particular hybridization site on the transcript array, a ratio of the emission of the two fluorophores can be calculated. The ratio is independent of the absolute expression level of the cognate gene, but is useful for genes whose expression is significantly modulated, e.g., by administering a drug, drug-candidate or other compound, or by any other tested event.

[0128] In one preferred embodiment of the invention, the relative abundance of an mRNA in two cells or cell lines tested (e.g., in a treated verses untreated cell) may be scored as perturbed (i.e., where the abundance is different in the two sources of mRNA tested) or as not perturbed (i.e., where the relative abundance in the two sources is the same or is unchanged). Preferably, the difference is scored as perturbed if the difference between the two sources of RNA of at least a factor of about 25% (i.e., RNA from one sources is about 25% more abundant than in the other source), more preferably about 50%. Still more preferably, the RNA may be scored as perturbed when the difference between the two sources of RNA is at least about a factor of two. Indeed, the difference in abundance between the two sources may be by a factor of three, of five, or more.

[0129] In other embodiments, it may be advantageous to also determine the magnitude of the perturbation. This may be done, as noted above, by calculating the ratio of the emission of the two fluorophores used for differential labeling, or by analogous methods that will be readily apparent to those of skill in the art.

[0130] 6.5. Bioinformatics and Statistics

[0131] Those skilled in the art will readily appreciate that the MPHTS assays of this invention will, at least in preferred embodiments, track a large amount of data from many sources including, e.g., expression levels for a large number of different genes in a variety of different cell and tissue types and under a variety of different conditions. The invention therefore preferably makes use of methods in bioinformatics and statistical analysis to integrate such data. Such analysis tools include, for example, clustering and class partitioning algorithms that enable a user to summarize and visualize effects of multiple variables on relationships within a data set. In a particularly preferred embodiment, the MPHTS methods of this invention make use of a statistical analysis tool referred to as “Principal Component Analysis” or “PCA”. The technique is well known in the art and may be implemented, e.g., using commercially available software such as the Partek suite of pattern recognition tools (Partek Inc., St. Charles, Minnesota).

[0132] By PCA analysis of gene expression data from different brain areas and disease states, a user is able to readily identify whether the major source or sources of variance within the data set are correlated with the particular cells or tissue and/or whether such variance is correlated with a neuropsychiatric disorder of interest. An exemplary figure depicting this analysis is set forth here, in FIG. 2. Those skilled in the art will readily appreciate and/or be able to select appropriate cutoffs (e.g., a maximum significant p-value) for use in such methods.

[0133] Statistically significant changes in gene expression may also be identified by coordinately regulated genes in distinct pathways, as well as coordinate changes of multiple genes within a common pathway (e.g., genes involved in a common metabolic pathway or process). These provide an aggregate level of statistical significance that far exceeds the statistical significance obtained for the genes individually.

[0134] In preferred embodiments, RNA extraction and/or hybridization experiments are repeated at least once, and more preferably multiple times for each sample to assure statistically robust and reproducible results. Changes in gene expression that appear to be statistically significant may also be confirmed by an independent experimental technique such as real-time polymerase chain reaction (RT-PCR), quantitative in situ hybridization, immunohistochemistry and functional assays of the translated protein(s), all of which are ell known and routinely used in the art.

[0135] 6.6. Uses of Gene Signatures in MPHTS

[0136] Once genes signatures or “efficacy genes” for a particular disorder have been identified and/or selected, they may be readily used, e.g., in screening assays to identify promising therapeutic compounds that may be useful for diagnosing and/or treating such a disorder. In particular, a candidate therapeutic compound may be identified in screening assays of the invention by identifying compounds that produce changes in the expression of signature genes that are similar to the changes observed in the ECS model, demonstrated in the exmples, infra. Such changes may be identified qualitatively, but are more preferably identified quantitatively; for example, by assigning a MPHTS “value” for each compound tested in the screening assay.

[0137] As an example, and not by way of limitation, such an MPHTS value may simply be the sum of changes in each signature gene's expression observed for a test compound in the screening assay. Preferably, these changes in the signature genes' expression levels are normalized as a percentage of the “optimal” change in each gene's expression. As used here, the change in expression of an signature gene is said to be “optimal” when it is approximately equal to the change in expression associated with a therapeutic benefit. Optionally, the change in each signature gene's expression can also be weighted, e.g., by the signature gene's score (as determined, e.g., using the algorithm set forth in Section 8.2, infra). The calculation of such a value can be easily represented mathematically by the equation: $\begin{matrix} {V = {\sum\limits_{i}{\omega_{i}E_{i}}}} & \left( {{Equation}\quad 1} \right) \end{matrix}$

[0138] In Equation 1, V is the MPHTS “score” calculated for a test compound in an MPHTS assay. E_(i) is the measured change in the expression of gene i in cells contacted with the test compound compared to the expression in cells that are not contacted with the test compound. As noted above, E_(i) will preferably be normalized to the “optimal” change associated with a desired therapeutic effect. For example, E_(i) can be expressed as the percentage or fraction of optimal change. ω_(i) indicates the score for the signature gene i. In preferred embodiments, ω_(i) obtained or derived from the score value calculated for gene i (e.g., according to the algorithm set forth in Section 8.2) and is converted to a percentage of the average score value or the signature genes that comprise the entire set used for drug screening.

7. EXAMPLES

[0139] The present invention is also described and demonstrated by way of the following examples. However, the use of these and other examples anywhere in the specification is illustrative only and in no way limits the scope and meaning of the invention or of any exemplified term. Likewise, the invention is not limited to any particular preferred embodiments described here. Indeed, many modifications and variations of the invention may be apparent to those skilled in the art upon reading this specification, and such variations can be made without departing the invention in spirit or in scope. The invention is therefore to be limited only by the terms of the appended claims along with the full scope of equivalents to which those claims are entitled.

[0140] 7.1. Gene Signatures of Electroshock Therapy as Modeled by Rodent Electroconvulsive Seizures

[0141] 7.1.1 Materials and Methods

[0142] Subjects. Male, Sprague-Dawley rats (ACE Animals; Boyertwon, PA) were 2-6 months of age and weighted 200-350 g at the start of each experiment. Animals were housed 2 per cage, understandard conditions, including ad libitum access to food and water in a 12:12 hour light/dark vivarium.

[0143] ECS Delivery. Animals were handled for 3 days prior to the beginning of the experiment for them to become adjusted to the seizure induction area and to application of the ear-clip electrodes. The seizure induction or sham-seizure induction areas consisted of plastic cages with fresh bedding material. Each cage was placed on a lab bench and a pair of electrodes was hung over the top of each cage. One set of electrodes was connected to the ECT Unit (Ugo Basile # 57800, Comerio, Italy) for use with the ECS animals and the other set (for control animals) was disconnected from the ECT unit such that no current could be delivered. Cage mates were run concurrently: one animal was randomly assigned to the shock group and the other to the control group. Both animals were removed from their home cage and placed in the appropriate testing cage. Each set of ear-clip electrodes were dampened with saline and attached to the deep region of the outer ear. The shock, a square wave pulse (pulse width=0.7 ms, 50 Hz) of 50 mA (Madsen et al., Biol. Psychiatry 2000, 47:1043-1049) was administered for a duration for 1.0 second to provide a tonic phase characterized by extension of all limbs and forward head extension, lasting for 10-15 seconds. Animals, both ECS and control, were returned to their home cage 10 minutes after seizure induction. Acute ECS animals (N=12) were given a single shock. Chronic ECS animals (N=12) were given one ECS per day for 10 consecutive days. Control group rats (N=12 acute and N=12 chronic) followed this protocol.

[0144] Visual observation of behaviors, side effects. Visual observations were sufficient to confirm the presence of clonic-tonic seizures immediately following the shock. Side effects during the post-ictal phase, which lasted about 10 minutes, were best characterized by heightened responsivity to auditory and tactile stimulation and decreased motor activity. General behavioral observations made in the home cage for up to 6 hours post-shock administration revealed sedation but no signs of recurrent seizures.

[0145] Brain area dissections. Animals were sacrificed by decapitation without anesthesia approximately 3-5 hours after the final ECS treatment. The brain was removed from the calvarium and immersed in crushed ice for 3-5 minutes. Coronal slabs of 2 mm thickness were prepared, from which the frontal cortex and hippocampus were dissected. The brain regions were weighed, frozen on dry ice, and stored at −80° C.

[0146] RNA Extraction. Total RNA was extracted from about 70-100 mg of frozen rat tissue with 0.7-1.0 ml of TRIzol reagent (Invitrogen; Carlsbad, Calif.) as per the manufacturer's instructions. The tissue was homogenized with a Polytron (Fisher) for 30 seconds. The homogenate was cleared by a 10 minute centrifugation at 10,000×g, and 0.2 ml of chloroform was added per ml of TRIzol. The samples were vigorously shaken for 20 seconds and incubated on ice for 10 minutes. The aqueous phase was separated by centrifugation at 10,000×g for 10 minutes, allowed to precipitate for 10 minutes, and the precipitate collected by centrifugation at 12,000×g for 10 minutes. The pellet was washed with 70% ethanol collected by a brief centrifugation, air dried, and resuspended in H₂O. The RNA was further purified using an RNeasy column (Qiagen; Valencia, Calif.). The purified RNA was quantified by a UV spectrophotometer, and RNA quality determined by capillary electrophoresis on an Agilent Bioanalyzer.

[0147] Gene expression profiling. RNA (10 μg) was converted to double-stranded cDNA following priming with an oligo-dT-T7 primer (Operon). The resultant cDNA was purified by phenol-chloroform extraction, ethanol-precipitated, and resuspended in H₂O. The purified cDNA was subjected to in vitro transcription using T7 polymerase in the presence of biotinylated UTP and CTP (Enzo Life Sciences, Farmingdale, NY). The resultant aRNA was purified with an RNeasy column (Qiagen, Valencia, Calif.), eluted in H₂O, and quantified using a UV spectrophotometer. aRNA (15 μg) was fragmented following the Affymetrix protocol. The quality of RNA fragmentation was checked on the Agilent Bioanalyzer, and 30 μl was added to 270 μl of hybridization buffer and hybridized to the Affymetrix U34A Rat Genome GeneChip®. After 16 hours of hybridization at 45° C. the GeneChip was washed, stained, and scanned according to the standard Affymetrix protocol.

[0148] Statistical analysis. Each gene on Affymetrix U34A gene chips is represented by one or more of about 8800 “probe sets”—i.e., sets of oligonucleotides that are derived from the mRNA sequence of that particular gene. Following hybridization and scanning, each probe set was assigned a “present” or “absent” call and an overall fluorescent signal intensity as determined by the Affymetrix Microarray Suite Software 5.0 following global normalization. A filtering of all probe sets was conducted to remove those genes that were clearly not expressed in the tissue, and such probe sets were not analyzed. Specifically, a probe set for a particular gene was retained for all samples only if it contained a “present” call for at least one sample, and only if the mean fluorescence intensity for either the control or treatment group was greater than 100. This procedure filtered out about 50% of all probe sets, consistent with the known expression of approximately 40-50% of rat genes within brain tissue. Absent or present calls are assigned by Affymetrix software on the basis of a comparison between the intensities of perfect oligonucleotide matches and single oligonucleotide mismatches. The finding that mismatch intensities are not significantly lower than those for the perfect match oligonucleotides does not necessarily imply that the gene is not expressed, as it may be a result from cross-hybridization of other mRNA species, other technical artifacts, or measurement noise. Therefore, present calls were counted in a liberal manner to avoid discarding probe sets where there was some indication that the gene may in fact be expressed. The alteration of many genes in more than one tissue or after both durations of ECS, and the confirmation of many gene changes by RT-PCR, confirms that measurements of some genes with relatively few present calls would be detected in independent tissues and by an independent method.

[0149] A principal components analysis (PCA) was performed using Partek Pro 2000 software package (Partek, Inc.). The analysis treated arrays as “objects” and probe sets as “variables” to detect outlier arrays. PCA plots mapped the distribution of arrays within a 3-dimensional space defined by the three greatest contributors to the variance in gene expression values. The position of an array on such plot reflects the overall expression profile of the sample as measured by the array. The arrays that appeared as outliers on the plots were removed from further consideration. Such PCA outliers are usually associated with a technical problem, such as marginally acceptable RNA quality, regional blotches in hybridization signals, or weak overall hybridization on the corresponding gene chip. For each of the four experiments, (acute and chronic ECS, frontal cortex and hippocampus), at least seven, but typically nine or all 10, of the rat samples from each control or treatment group were retained for further analysis.

[0150] The ability of acute or chronic ECS to alter gene expression versus that of sham-treated controls was determined for each tissue type. The magnitude of expression level change was estimated as the ratio of the means between the treated and control groups. Probe sets were considered as “hits” only when the average normalized signal intensity exceeded 100 in either of the groups, the ratio of means exceeded 1.5 or were less than 0.67 and the p value of the two-tailed Welch t test was less than 0.05.

[0151] In a multiple testing setting, the p value of 0.05 does not necessarily mean that the probability of type 1 error is really 0.05, unless the Bonferroni's correction has been applied. Bonferroni's correction was not applied in these experiments, however, because it would result in many type 2 errors and the loss of most valid hits. The uncorrected p value of a statistical test is therefore useful here primarily as a measure of difference between data sets. According to a recent study (Xu & Li, Bioinformatics 2003, 19:1284-1289), a parametric p value may outperform a permutational one as a distance metric. Additionally, the 1.5 ratio of means cut-off used here increased the verification rate of gene expression changes.

[0152] GeneSpring® analysis software (Silicon Genetics) was used to evaluate the overlap between the gene changes identified in the four comparisons. Since about 3,500-4,400 genes were present in each sample, a significant number of false positives was expected. To help remove false positives, and also to increase confidence in selecting gene hits for RT-PCR confirmation, each statistically significant gene change was ranked objectively by a biostatistical ECS gene selection algorithm (set forth in the appendix at Section 8.2, infra) according to its statistical and biological significance, and by the robustness of experimental change, including whether it was altered in one or both tissues and after acute, chronic, or both ECS durations.

[0153] Real-time polymerase chain reaction assay (RT-PCR). Total RNA (2 μg) were subjected to DNAse treatment in a 10 μl reaction containing 1 μl 10×DNAse I reaction buffer, and 1 μl DNAse I (Invitrogen, Carlsbad, Calif.). The reaction was carried out at room temperature for 10 minutes. One μl of EDTA (25 mM) and 1 μl of oligo (dT)_(12-18 mer) (0.5 μg/ml, Invitrogen, Carlsbad, Calif.) were added to DNAse reaction and heated to 70° C. for 15 minutes in a water bath to simultaneously inactivate the DNAse I activity and eliminate RNA secondary structure to allow oligo dT-poly A annealing. The sample was placed on ice for 2 minutes and collected by brief centrifugation. The RNA in the sample was reverse-transcribed into cDNA by the addition of 8 μl of master mix containing 4 μl of 5×first strand buffer, 2 μl DTT (0.1 M), 1 μl dNTP's (10 mM each), and 1 μl SuperScript II (200U/μl) (Invitrogen, Carlsbad, Calif.), and incubated at 42° C. for 45 minutes. The RT reaction was diluted 10 fold with dH₂O and stored at 4° C.

[0154] For each specific gene tested, diluted cDNA (5 μl) was added to 45 μl PCR reaction mixture that contained 25 μl of 2×SYBR® Green PCR Master Mix (Applied Biosystems, Warrington, UK) and 25 picomole of each forward and reverse primer. Each sample was subjected to 40 cycles of Real Time PCR (ABI 7900, Applied Biosystems, Warrington UK) where fluorescence is measured several times during each cycle of 2-step PCR alternating between about 95° C. for 15 seconds and 60° C. for 1 minute. The threshold cycle (Ct), or point at which signal fluorescence exceeds background, for each sample for each gene was compared to a standard curve to determine a relative expression value. The standard curve was generated by real time PCR analysis of five, 10-fold serial dilutions of a cDNA generated from the RNA of rat cortical stem cells. This method allows a relative comparison between samples. The expression value of each gene was normalized to the relative amount of GAPDH expressed in that sample to calculate a relative amount of transcript present for each gene. The normalized expression values for all control and treated samples were averaged and an average fold change determined. A Student's t test was conducted between the normalized relative expression values for each individual control and treated samples to determine statistical relevance.

[0155] 7.1.2 Results

[0156] Four sets of microarray hybridizations to evaluate the effects of acute or chronic ECS treatment on gene expression in the frontal cortex and hippocampus. Plots of the mean abundances of each probe set measured with the Affymetrix U34A rodent chip are illustrated in FIGS. 1A-1D the frontal cortex (FIGS. 1A and 1B) and hippocampus (FIGS. 1C and 1D) of rats that with exposed to either acute (FIGS. 1A and 1C) or chronic (FIGS. 1B and 1D) ECS, compared to sham control groups that received no ECS. At gene abundances above 100, each scatter plot of mean intensities diplayed minimal skewing about the unity line. The distribution of points was generally quite close to the unity line, and statistically significant outliers (illustrated by white dots in FIGS. 1A-1D) were frequently well outside the main clusters of genes. Principal components analysis (PCS) revealeded that the frontal cortex and hippocampus transcriptome segregated into distinct clusters similar to that reported for different human CNS brain regions (Palfreyman et al., Current Drug Targets 2002, 1:205-216). Outlier samples in the PCA analysis could be readily distinguished, and either one sample, two samples or no sample group were removed from the statistical analysis so that the final n was 8-10 per group.

[0157] Data quality evaluation. In order to first determine the number of samples per group needed to reliably identify differentially expressed genes, subgroups of 2 to 10 samples were randomly created from the sham- or ECS-treated groups (or from these two groups combined) of gene expression data in the acute hippocampus. Welch t tests were performed for all gene changes comparing subgroups within or between the sham control and ECS groups as well as between “mixed” subgroups, and this was repeated for 100 different comparisons. Each time, the number of “hits” passing p value cut-off of 0.01 was counted, and these data are plotted in FIG. 2. Statistical theory predicted that about 40 genes are expected to appear as hits due to random technical and biological variations from the 3,500 to 4,400 detected genes, independent of any real ECS treatment effect. Comparisons between subgroups of samples drawn from within a group of animals or from the “mixed” animals produced numbers of significant gene changes that were mostly below this theoretical chance level, and gradually approached it with increases in subgroup size, as can be seen from visual inspection of FIG. 2. In contrast, the number of significantly changed genes in the analysis of sham-versus ECS-treated animals increased steeply from chance level at group size of 2, to 15-fold above chance at group sizes of 10. This analysis demonstrates that there is actually a highly significant effect of the ECS shock treatment on gene expression measured by the microarrays. The analysis also demonstrates that, while greater numbers of animal per group increase statistical power, only 8-10 animals per group are necessary to reliably detect large numbers of changes in gene expression levels.

[0158] Identification and ranking of differentially expressed genes. A total of 135 unique genes, represented by more than 200 probe sets, were found to be differentially expressed in ECS-treated rats compared to controls in at least one of the four different ECS experiments. These genes are listed in the Appendix, infra (see, in particular, at Section 8.3) along with the Accession number for their nucleotide sequence on the GenBank database. The Table in Section 8.3 also provides the expression ratio and p-value determined for each of these genes in the microarray analysis, as well as the “score” determined using the algorithm set forth at Table 1, supra. Many of these genes, whose names are shaded in the table at Section 8.2, have not been previously associated with either ECS or ECT.

[0159] A Venn diagram representing the distribution of changes among the 135 genes in different tissues and ECS treatment groups is shown in FIG. 3. Forty-four genes were identified that had significant differential expression in multiple tissues and/or in multiple ECS treatment groups. The most dramatic response to ECS treatment occurred in the hippocampus, where 87 genes were detected as differentially regulated following a single, acute shock. Chronic ECS treatment resulted in fewer gene changes in the hippocampus (64 compared to 87), about half of which were distinct from those that changes after a single ECS. The frontal cortex was much less responsive than the hippocampus, with 36 genes identified as differentially regulated following acute treatment, and 36 genes identified as differentially regulated by chronic ECS treatment. Twelve genes that were regulated in the same direction in the hippocampus by ECS and following exercise (Molteni et al., Eur. J. of Neurosci. 2002, 16:107-116; Tong et al., Neurobio. of Disease 2001, 8:1046-1056) are indicated in Section 8.3 by an asterix (*) next to the gene name. A supplemental list of genes with p values less than 0.005 and having differential expressing changes less than 1.5 fold was also identified. These genes are listed separately, infra, in Section 8.4 infra. As with the genes listed in Section 8.3, many of the genes listed in Section 8.4 have not been previously associated with either ECS or ECT. These genes' names are shaded in the tables at Sections 8.3-8.4.

[0160] Many genes are represented by more than one probe set on the Affymetrix RG-34A. chip. Among genes exhibiting significant change due to ECS treatment, 13 were identified by multiple probe sets. These genes are indicated in Tables of Sections 8.3 and 8.4 by italicized type in those tables. Five of these thirteen genes were selected to determine if all of the probe sets for each gene were in agreement. The numbers of probe sets reporting significant and co-directional change due to ECS treatments were as follows: five out of seven for BDNF; two out of two for Cox-2; two out of four for Jun, two out of three for TIEG, and 1-2 out of four (depending on the treatment and tissue) for catalase. The remaining five probe sets for BDNF, Jun and TIEG, as well as one of the catalase probe sets, produced only absent calls and/or less than 100 abundance units in all rat brain samples. These findings suggests that some of these probe sets simply failed to measure their target mRNA, and that ECS actually did affect at least four out of these five genes.

[0161] Twenty-six of the identified genes in Section 8.3 were found to be similarly up- or down-regulated in both the hippocampus and frontal cortex (FIG. 3), indicating that many brain-wide changes following shock therapy may occur independent of cellular heterogeneity of these different regions. In addition, 39 gene were found to change after both acute and chronic ECS treatments. Of the genes that were either up- or down-regulated in more than one tissue type and/or after both acute and chronic ECS therapies, one gene changed in different directions. This gene, solute carrier number 3, member 1, decreased expression in both the frontal cortex and hippocampus following chronic ECS, but increased expression in the hippocampus after acute ECS. The expression of five genes in particular was similarly affected by ECS in every experiment (labeled as “all” in the Table of Section 8.3)—i.e., following both chronic and acute ECS treatment in both the hippocampus and frontal cortex. These five genes are: brain derived neurotrophic factor (BDNF), prostaglandin-endoperoxide synthase 2 (COX-2), neuronal activity-regulated pentraxin (Narp), TGFβ-inducible early growth response, and tissue inhibitor of metalloproteinase 1 (TIMP-1). Each of these genes or their biological pathways have been previously reported to change individually following either ECS treatment or seizure in rats (see, in particular, Zetterstrom et al., Brain Res. Mol. Brain Res. 1998, 57:106-110; Nibuya et al., J. Neurosci. 1995, 15:7539-7547; and Hashimoto, Brain Res. 1998, 804:212-223). However, the combined changes in expression of these genes has not heretofore been demonstrated.

[0162] About two-thirds (i.e., 92 out of 135) of genes listed in Section 8.3 are changed their expression in only a single experiment. Hence, acute and chronic ECS can influence gene expression in different ways, and genes in the hippocampus and frontal cortex often respond differently to ECS treatment.

[0163] Verification of gene signatures by RT-PCR. The measurement of expression levels for a large number of genes can result in a significant number of false positive results (Zhang, Proc. Nat. Acad. Sci. U.S.A. 2002, 99:12509-12511; Zhou et al., Proc. Nat. Acad. Sci. U.S.A. 2000, 99:12783-12788; and Xu & Li, Bioinformatics 2003, 19:1284-1289). To confirm the validity of gene expression changes in these experiments, the expression levels of sixty genes were retested using quantitative real time polymerase chain reaction (RT-PCR) using an independent sampling of the RNA samples in the microarray analysis. Approximately 1,200 RT-PCR assays were conducted with hippocampal or frontal cortex tissues after actue or chronic ECS, focussing on those genes with the largest fold change and the best statistical signficance in the microarray analysis. Genes were also selected for confirmation by RT-PCR to provide a representation of various protein classes and for their biological relevance to depression or its treatment. The results of this analysis are presented infra, in Section 8.5 of the appendix. Genes were identified with a liberal p<0.05 criteria but required at least a 1.5-fold increase or decrease to be considered significant. The abbreviations “HCA” and “HCC” refer to genes identified in the hippocampus following either acute or chronic ECS treatment, respectively. The abbreviation FCA indicates that a gene was identified in the prefontal cortex following acute ECS treatment.

[0164] A “first pass” validation of the microarray results indicated a 50% validation rate. Principal failures in these experiments were apparently mostly caused by low technical performance of individual primer pairs, as evidence by a failure of the amplified signal to reach asymptotic intensity levels characteristic of genes that are efficiently amplified. For those primer pairs that failed, a second and unique primer pair was selected from the same region of the transcript to generate an amplicon the same length as the original primer pair. Non-validated genes were retested with the new primer pairs.80% of these RT-PCR retested genes produced the expected asymptotic signal across their amplification cycles, and validated results obtained in the microarray analysis.

[0165] The RT-PCR experiments confirmed statistically significant differential expression for 43 out of the 60 genes tested (i.e., 63%). In many instances, the average magnitude of the change in expression for a given gene was only 30% between the two groups. However, individuals within those different two groups would often differ by a factor of two or more. Hence, the RT-PCR procedure was used to quantify differences between individuals that exceed differences between the different experimental groups.

[0166] RT-PCR produces a false negative rate of approximately 30% for any large set of genes that has been found to be significantly changed on a microarray platform. Only a few genes exhibited high, asymptotic intensity levels during RT-PCR while, at the same time, failing to replicate microarray results because of low statistic significance. However, these nevertheless displayed a magnitude and direction of gene abundance change that were similar to those seen with the microarrays. Therefore, it is more likely that greater than 63%, and possible closer to 80% of the genes detected by microarray analysis of ECS tissues (i.e., in Sections 8.3 and 8.4) represent true positives based on RT-PCR validation. Repeated observations of expression changes within different tissues for these genes, confirmation of the magnitude and direction of expression changes for 63 to 80% of these genes by RT-PCR, and the substantial agreement of many observed changes with other literature findings all confirm that the results reported here are valid.

[0167] 7.2. Selection of Genes for ECS Gene Signatures

[0168] To select differentially expressed genes that are most informative and therefore useful in an ECS signature, e.g., for MPHTS type screening assays of the invention, the genes listed in Sections 8.3 and 8.4, infra, were prioritized for overall significance using the algorithm set forth in Section 8.2. This algorithm is based on statistical, biological and experimental considerations that account for factors such as a gene's known relationship (if any) with bipolar affective disorder and/or unipolar major depression—as well as accounting for the magnitude and/or statistical significance (e.g., the p-value) of the gene's differential expression in one or more ECS experiments. The algorithm minimizes biases (for example, favoring genes that change in one particular region or tissue) that are likely to occur when other methods of selection are used. As such, algorithms such as in the one in Section 8.2, infra, provide an objective and preferable way to identify informative genes that can be used in an ECS gene signature.

[0169] For each gene listed in Sections 8.3-8.4, a numerical value was assigned for the three aspects listed in Section 8.2—i.e., for statistical considerations (I), for biological considerations (II) and for experimental considerations (III). The numerical value(s) for each consideration were computed as described in Section 8.2. For biological considerations (II) a numerical value was assigned for each gene based on reports from the literature (if available).

[0170] The algorithm score thus calculated for each gene is set forth in the far right-hand column of the tables at Sections 8.3-8.4. 18 out of the 20 genes most highly ranked by the algorithm have been identified by prior investigators to change after ECS, whereas none of the 30 most lowly ranked genes have been previously associated with ECS. Yet, prior knowledge about a gene's change in response to ECS accounts for only about 2% of the algorithm's total score, demonstrating that the algorithm is able to independently prioritize genes according to their known response in ECS. In contrast, ranking genes according to only the level of their expression change and/or its p-value produces a more random distribution of the genes that have been previously associated with ECS.

[0171] To better identify differentially expressed genes that are particularly preferable, e.g., for use in an ECS gene signature, a biological consideration score (II) was recalculated for each gene—this time using experimental data from homologous genes in human cells and/or tissues. Specifically, expression levels of homologous genes were obtained from “disease signatures” in human tissue samples that had been obtained from individuals diagnosed with a neuropsychiatric disorder. In additiona, expression levels of homologous genes in “drug signatures” from neuronal cells or animals treated with neuropsychiatric drugs (e.g., valproate, clozapine, carbamazepine, and lithium) were also considered. Such “drug signatures” and “disease signatures” have been previously described, as have routine methods and techniques by which they can be obtained. See, in particular, U.S. patent application Publication No. 2003/096264 A1.

[0172] Homologos and/or orthologs of the rat genes listed in the appendices, infra, can be readily identified, for example by their level of sequence identity to the recited rat nucleic acid sequences, or by the level of sequence identity and/or homology to amino acid sequences that they encode. Alternatively, homologs and orthologs (including those from other species, such as humans and mice) can be identified by hybridization under conditions of appropriate stringency, such as those described, supra, in Section 6.1. In a preferred embodiment, appropriate homologs and/or orthologs are identified by routine clustering algorithms. For example, the NCBI UniGene database (accessed via the URL <http://www.ncbi.nlm.nih.gov/UniGene>) groups genes into appropriate clusters of homologous sequences from the same and/or different species of organism. See, Schuler, J. Mol. Med. 1997, 75(10):694-698; Schuler et al., Science 1996, 274:540-546; and Boyuski & Schuler, Nature Genetics 1995, 10:369-371.

[0173] Section 8.6 below provides a list of genes that are differentially expressed in the ECS experiments (Section 7.1, supra) along with the rat, human and mouse UniGene cluster number for each gene and its homologs. The GenBank Accession number for a representative nucleotide sequence (or fragment thereof) for each gene is also provided. Overlapping expression data for each gene in human tissue, human cells, and mouse/rat tissue is also provided, where available. The column “PGI II” gives the algorithm score for biological considerations (II) calculated using this additional information. The next column (“Groups I, II (Literature) and III”) provides algorithm scores computed using biological consideration information from the literature—i.e., the algorithm scores set forth in preceding Sections 8.3 and 8.4. The far right-hand column in Section 8.5 gives the “ECS algorithm score”—i.e., the sum of the original score and PGI II.

[0174] The actual number of ECS signature genes used in a particular application (i.e., for a particular “gene signature) may vary and generally depends on the need for additional information afforded by including more signature genes, balanced by considerations such as the cost and/or effort involved in addition additional signature genes, e.g., in a microarray or PCR assay platform. As an example preferred gene signatures of the invention comprise nucleic acids corresponding to five or more of the genes set forth in Sections 8.3-8.5, infra. In other embodiments, a gene signature of the invention can comprise nucleic acids corresponding to and/or capable of detecting at least 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150 or more of the signature genes set forth in Sections 8.3-8.5, infra.

[0175] Preferably, signature genes that are most informative are used. Such genes can be identified, for example, by using an algorithm such as the one set forth in Section 8.2, infra, or another objective algorithm that objectively ranks genes for use in an ECS signature. The table set forth at Section 8.5 below, which lists signature genes according to their ECS algorithm score, is therefore particularly useful for selecting useful and preferred genes in an ECS gene signature. Hence, as an example, preferred gene sgiantures of this invention comprise nucleic acids that correspond to and/or are capable of detecting at least the first 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 1120, 120, 130, 140, 150 or all of the signature genes list in Sections 8.5—i.e., nucleic acids corresponding to and/or capable of detecting at least SEQ ID NOS:1-5; 1-10; 1-15; 1-20; 1-25; 1-30; 1-35; 1-40; 1-45; 1-50; 1-60; 1-70; 1-80; 1-90; 1-100; 1-110; 1-120; 1-130; 1-140; 1-150; or 1-152.

[0176] Alternatively, a gene signature of the invention may comprise nucleic acids that correspond to and/or are capable of detecting orthologs and/or homologs of these genes. As noted above, homologos and/or orthologs of the rat genes listed in the appendices, infra, can be readily identified, for example by their level of sequence identity to the recited rat nucleic acid sequences, or by the level of sequence identity and/or homology to amino acid sequences that they encode. Alternatively, homologs and orthologs (including those from other species, such as humans and mice) can be identified by hybridization under conditions of appropriate stringency, such as those described, supra, in Section 6.1. In a preferred embodiment, appropriate homologs and/or orthologs are identified by routine clustering algorithms. For example, the NCBI UniGene database (accessed via the URL <http://www.ncbi.nlm.nih.gov/UniGene>) groups genes into appropriate clusters of homologous sequences from the same and/or different species of organism. See, Schuler, J. Mol Med. 1997, 75(10):694-698; Schuler et al., Science 1996, 274:540-546; and Boyuski & Schuler, Nature Genetics 1995, 10:369-371.

[0177] As a particular example, and not by way of limitation, the gene signature of the invention can be used, e.g., in detection kits (for example in any of the applications described in Sections 7.3-7.5, below) for detecting one or more signature genes. Examples of such kits include, but are not limited to, microarrays and PCR reaction kits to name a few. Preferred kits will comprise probes and/or primers for amplifying and/or detecting one or more ECS signature genes. Hence, a typical kit of the invention will comprises nucleic acids (preferably oligonucleotides) that specifically hybridize to ECS signature genes of the invention under defined conditions (which will depend on the particular application for which the kit is used, and include hybridization conditions described in this specification supra).

[0178] Kits of the invention will typically comprise at least one probe or primer which is capable of specifically hybridizing (e.g., understringent conditions) to an ECS signature gene. Preferred kits comprises a plurality of probes or primers, each capable of specifically hybridizing to a different ECS signature gene. PCR reaction kits will typically comprise a at least one pair of primer (i.e., a primer pair) that is capable of specifically amplifying an ECS signature gene. Hence, a primer pair in such kits will typically include a “forward” primer and a “reverse” primer. Preferred PCR kits of the invention comprise a plurality of primer pairs, each of which is capable of amplifying a particular ECS signature gene by PCR. PCR kits of the invention also include reverse transcription PCR (RT-PCR) kits, that are capable of converting mRNA from ECS signature genes into cDNA, and then amplifying said cDNA in a PCR reaction. Hence, RT-PCR kits of the invention will also typically comprise at least one primer pair, and may also comprise a “reverse primer;” i.e., an oligonucleotide that is capable of priming the reverse transcription of mRNA from an ECS signature gene. Preferred RT-PCR kits comprise a plurality of primer pairs and reverse primers, which are capable of reverse transcribing and amplifying mRNAs for a plurality of different ECS signature genes.

[0179] In some embodiments, the reverse primer is a separate and distinct oligonucleotide that is different from either the forward or reverse primer. More preferably, however, either the forward or reverse primer for amplifying an ECS signature gene is also capable of reverse transcribing mRNA of that ECS signature gene, and so also functions as a reverse transcription primer in an RT-PCR kit.

[0180] Kits may contain additional reagents or other components for detecting ECS signature genes, or such additional components may be provided by a user. For example, preferred kits of the invention may contain at least a polymerase enzyme and one or more labeled nucleotides. A kit of the invention may also contain buffer or other reagents.

[0181] 7.3. Evaluating Patient Treatment Using ECS Gene Signatures

[0182] The ECS gene signatures identified herein (e.g., in Sections 8.3-8.5, infra) can be used in evaluating the efficacy of a treatment of a patient suffering from depression. In this evaluation, a peripheral blood sample is taken from the patient before the treatment, and at 1 day or 1 week after initiating the treatment. Lymphocytes or other accessible peripheral cell types are isolated, and the level of expression of all genes within these cells, or a subset of genes indicated in Tables 5-7, is then determined by one or more quantitative RNA measuring techniques. For example, the sample can be treated with Trizol reagent and other chemical procedures in order to prepare RNA. Probes and microarrays for the analysis are prepared using methods established for gene expression analysis, such as by any one of the commonly available quantitative mRNA detection methods, e.g., reverse transcriptase-polymerase chain reaction (RT-PCR) or microarray analysis using commercially available chips. The alteration of gene expression is quantified in the lymphocyte sample(s) and determined to be either higher or lower than the value obtained in the pre-treatment, baseline sample(s).

[0183] Resulting data showing gene changes in the same direction as those obtained in the ECS samples sets indicates indicate that the treatment method is effective in treating depression in the patient, whereas a gene change in the opposite direction indicates that the treatment is less effective for this purpose.

[0184] 7.4. Use Of ECS Gene Signatures To Identify Therapeutic Compounds

[0185] The gene signatures provided in Sections 8.3-8.5, infra, can be used in screening for antidepressive agents. In this screening method, a set of genes is selected as those which change significantly following ECS treatments (see Tables 5-7), and the corresponding probes are selected by having sequence identity with a significant region of the selected genes. Methods and procedures that are common to those skilled in the art of quantifying RNA levels in cells, using, e.g., reverse transcriptase-polymerase chain reaction (RT-PCR), microarray analysis with commercially, high-density cDNA or oligonucleotide-based chips, or miniarrays that incorporate relatively small (e.g., 10-300) gene sequences corresponding to the genes listed in Sections 8.3-8.5, are used. Microplates wherein each well contains a fixed number of cultured neuronal cells from a neuronal, neuroblastoma, astrocytic, or glial cell line, or a primary neuronal or lymphoblast culture, or another suitable cell culture, are prepared and used as described in the co-pending U.S. patent application Ser. No. 10/175,523, filed Jun. 18, 2002.

[0186] The assay is conducted by contacting neuronal cells with a number of test agents, exposing the cells to the test agents for 1-4 days, and the effect on the expression of the select set of genes is evaluated. Resulting data showing gene changes that are in the same direction as those identified following ECS exposure to rats, particularly those summarized in Tables 5-7, for a test agent shows that the agent can be effective in treating or preventing depression.

[0187] 7.5. Diagnostic Uses of ECS Gene Signatures

[0188] The gene signatures identified in Section 8.3-8.5, infra, can also be used for the prognosis or diagnosis of a patient suspected to be suffering from, or at risk for developing, a depression.

[0189] In this evaluation, samples of blood or other accessible tissues are obtained before, and at 1, 2, 4 and additional weeks after, treatment. Circulating lymphocytes (white blood cells) or are obtained from the samples and subjected to gene expression analysis using any one of the commonly available quantititaive mRNA detection methods, such as reverse transcriptase-polymerase chain reaction (RT-PCR), microarray analysis using commercially available chips, or other methods. The alteration of gene expression is quantified in post-treatment sample(s) and determined to be either higher or lower than the value obtained in the pre-treatment, baseline sample(s).

[0190] Resulting data showing changes in gene expression that are in the opposite direction as gene changes associated with ECS as listed in Tables 5-7 indicates that the patient is suffering from depression or is at risk for developing a depression, whereas the change in the same direction, or absence of a change, indicates a lower risk for depression. In response to an putative antidepressant agent (to determine prognosis of disease following initial baseline assessments), changes in gene expression will have prognostic value as well. In this example of the embodiment of the invention, resulting data that showed changes in gene expression that are in the same direction as gene changes associated with ECS as listed in Tables 5-7 indicate that the therapeutic agent is improving the patients state of depression, whereas the change in the opposite direction, or absence of a change indicates a lower effectiveness of the antidepressant potential of the test agent.

[0191] 8. Appendix

[0192] 8.1. Electroconvulsive Shock (ECS) Therapy and Gene Regulation (Literature Review) Post final ECS shock tissue Author, YR level/dose collection Altered Genes and Brain region Kim, et al., Mol Cells 2001, Single shock 3-24 hours Decreased Inositol 1,4,5- 12: 173-177 (hrs) triphosphate receptor 1 (InsP3R1) mRNA in dentate gyrus and CA1 Jensen et al., J 14 days Decreased POMC mRNA in the Neuroendocrinol 2001, Hypothalamic-pitutitary-adrenal 13: 887-893. (HPA) axis Jang et al., Prog Chronic Adenylate cyclase (AC) isoforms Neuropsychopharmacol Biol in hippocampus and cerebellum Psychiatry 2001, 25: 1571-81 Kondratyev et al., Brain Res Low- ECS treatment prevented elevation Mol Brain Res 2001, 91: 1-13. sensitivity of Bcl-XS mRNA after prolonged (minimal) for seizure activity in hippocampus 7 days and rhinal cortex Koubi et al., Brain Res Acute shock 72 hrs Decreased TH and TpOH protein 2001, 905: 161-170 levels in Locus Cereuleus (LC), Central tegmental area and Raphe Centralis (TpOH only) mRNA changes only in LC; increase TpOH protein levels in Frontal cortex and increase TH protein in hippocampus Shen et al., Psychiatry Clin Acute/ 9-24 hrs/ Decreased Serotonin transporter Neurosci 2001, 55: 75-77 chronic 1 hr-2 (5-HTT) mRNA in raphe nucleus weeks Burnet et al., Eur J Single shock/ Increased Substance P (tachykinin Pharmacol 2001, 413: 213-219 5 shocks on NK(1)) receptor binding site alternate days density, but no mRNA changes in cerebral cortex Chen et al., Synapse 2001, Acute  2 hrs Increased GRFalpha-1 and 39: 42-50. Chronic GFRalpha-2 mRNA in dentate gyrus Cho et al., Exp Mol Med Chronic Differential changes in PDE 2000, 32: 110-114 isoform mRNA in hippocampus and striatum Lammers et al., Mol 10 days Increased D3 receptor mRNA Psychiatry 2000, 5: 348-388 Shell of the nucleus accumbens Madsen et al., Neuroscience 14 days 24 hrs Increased Neuropeptide Y receptor 2000, 98: 33-39 Y1 and Y2 mRNA in CA1 and CA3 Husum et al., 10 days Increased preproNPY mRNA in Neuropharmacology 2000, the CA1 and dentate gyrus 39: 1463-73 Valentine et al., Brain Res  6 hrs Increase Fragile X (fmr1) mRNA Mol Brain Res 2000, in dentate gyrus granule cell layer 75: 337-341 Burnet et al., Neurosci Lett Single, Immediately Decreased 5-HT1A receptor 1999, 277: 79-82 repeated, and 3 weeks mRNA in CA4 Hippocampus and after last increased in dentate gyrus by shock single or repeated ECS. Repeated ECS increased cortical 5-HT2A receptor mRNA Pei et al., Neuroscience Single shock/ Single shock = Transient reduction 1999, 90: 621-627 5 shocks over of GIRK1 mRNA in dentate gyrus 10 days Repeated shock = increase in GIRK2 and GIRK1 mRNA in dentate gyrus Takahashi et al., J Neurosci Increased Phosphodiesterase (PDE) 1999, 19: 610-618 4 subtypes PDE4A, PDE4B mRNA in prefrontal cortex Watkins et al., Brain Res Single and Levels Dentate gyrus: increased mRNA Mol Brain Res 1998, 61: 108-113 repeated return to for NMDA subunits NR2A and control NR2B; reduced mRNA for values after metabotropic glutamate receptor 48 hrs mGLU5b Suda et al., J Neurochem Acute and Increase in Phosphodiesterase type 1998, 71: 1554-1563 repeated IV (PDE IV) mRNA in various brain regions after acute shock; repeated shock increased PDE IV in frontal cortex Zetterstrom et al., Brain Res Acute and 6-48 hrs BDNF increased with both acute Mol Brain Res 1998, 57: 106-110 chronic and chronic, both evident at 6 hours, chronic shock effete evident at 48 hours, but acute shock changes gone at 24 hr Garcia-Garcia et al. Acute and Corticotropin releasing factor Neuroreport 1998, 9: 73-77 chronic (CRF) increased in paraventricular nucleus; propiomelanocortin (POMC) increased in arcuate nucleus and ventromedial nucleus and Proenkephalin (PENK) in nucleus accumbens Pei et al., Neuroreport 5 times over 6-24 hrs Increased mRNA MAP2 in dentate 1998, 9: 391-394 10 days gyrus O'Donovan et al., J .5-1 hr and Increased Erg-1 mRNA in dentate Neurochem 1998, 70: 1241-1248 4 hrs granule cells returned to normal by 4 hr Increased Erg-3 peak at 4 hr Xing et al., Brain Res Mol Single 15 min-4 hrs Immediate increase in Brain Res 1997, 47: 251-261 Nurr1mRNA in dentate gyrus granule cells, peak at 1 hr and return to normal by 4 hr Wang et al., Neurosci Lett Increased Preprotachykinin-A 1996, 226: 151-154 (PPT-A) mRNA in CSF and aqueous humor of eye in RABBIT Pei et al., Neuroscience Acute and 6 hrs, 24 hrs Acute: reduced voltage-dependent 1997, 78: 343-350 chronic and 3 weeks potassium channel subunits Kv1.2 and Kv4.2 at 6 hrs and normal at 24 hrs in dentate gyrus Chronic: Kv1.2 at 6 hrs and normal at 24 hrs in dentate gyrus Kv4.2 no change at 6 hr elevated at 24 hr Zachrisson et al., Eur J Decrease in preprotachykinin-A Pharmacol 1997, 319: 191-195 mRNA in caudate -putamen and decrease in tachykinin NK1 receptor mRNA positive neurons in Nucleus Accumbens Zachrisson et al., Brain Res 6 shocks in 2 Decrease in CCK mRNA in Mol Brain Res 1997, 43: 347-350 weeks caudate-putamen Jung et al., Biol Psychiatry Single and 20 30 min-90 min Varying effects in tetradecanoyl 1996, 40: 503-507 days phorbol acetate-inducible sequences (TIS) 1, 7, 8, 1, 21 in varying brain regions McGowan et al., Eur J 5 shocks over G protein subunits Gs alpha Pharmacol 1996, 306: 249-55 5 days mRNA decreased in CA3 and CA1, G9(o) alpha mRNA increased in dentate gyrus and Gi2 alpha mRNA reduced in dentate gyrus an CA3 Woldbye et al., Brain Res Single shock Various Prominent induction of c-fos 1996, 720: 111-119 times up to mRNA and FOS in dentate gyrus 24 hrs Porter et al., Brain Res Single and 5 Acute: reduced KA1 mRNA in 1996, 710: 97-102 shocks over CA3a increased KA2 MRNA in 10 days dentate gyrus. Repeated shock: reduced KA1 mRNA in CA3c and CA3c and CA3a-b; KA2 mRNA increased in dentate gyrus All changes back to normal at 3 weeks post shock Naylor et al., Brain Res Mol Repeated 24 hrs Increased mRNA for GluR1 Brain Res 1996, 35: 349-353 shock subunit of AMPA receptor Fitzgerald et al., J Acute Increased ICER and CREM Neurochem 1996, 66: 429-432 isoforms in hippocampus, frontal cortex and cerebellum Chen, et al., Mol Acute and FosB and FosB-like proteins Pharmacol 1995, 48: 880-889 Chronic Nibuya et al, J Neurosci Acute (1 d) Acute: increased BDNF mRNA 1995, 15: 7539-7547 and Chronic recovered by 18 hr Chronic: (21 d) decreased induction of BDNF and trkB mRNA, but prolonged their expression in dentate gyrus; in CA1 and CA3, elevated induction of BDNF and trkB Mrna Smith et al., Acute & 4 hrs and 24 hrs Acute and repeated: increased D1 Psychopharmacology 1995, repeated (5 and D2 receptor mRNA at 4 hours 120: 333-340 shocks in 10 d) but NOT 24 hours in Nucleus Accumbens Burnet et al., Neuroreport 5 shocks in 10 Increased 5-HT1A and 5-HT2A 1995, 6: 901-904 days receptor mRNA in dentate gyrus and neocortex and decreaed 5- HT1A in CA3c Lindefors et al., 1, 3, 9, 24 hrs Brain derived neurotrophic factor Neuroscience 1995, 65: 661-670 mRNA and trkB mRNA increased in hippocampus Dziedzicka-Wasylewska & Increased proenkephalin mRNA in Rogoz, J Neural Transm nucleus accumbens Gen Sect 1995, 102: 221-228 Kim et al., J Neurochem Single 6 hrs & 12 hrs 6 hrs: decreased Inositol 1,4,5- 1994, 63: 1991-1994 triphosphate (InsP3) 3-kinase mRNA in dentate gyrus returned to normal at 12 hours Brady et al., J Clin Invest Single and 24 hrs-8 wk Single: Increased Corticotropin- 1994, 94: 1263-1268 repeated releasing hormone (CRH) mRNA in the paraventricular nucleus and TH mRNA in the locus coeruleus at 24 hrs Repeated: some increase that persisted for up to 3 weeks (CRV) and up to 8 weeks (TH) Passarelli et al., Neurosci Single and 2 hrs Increased expression of hcs73 in Lett 1994, 177: 147-150 repeated (7 d) dentate gyrus, CA3 and median habenular nucleus; induced hsp70 mRNA in dentate gyrus (same in single and repeated shock) Mikkelsen et al., Brain Res 14 days 24 hrs 10 × increase in PreproNPY Mol Brain Res 1994, 23: 317-322 mRNA in dentate gyrus and piriform cortex Follesa et al., Exp Neurol Corneal shock Min: bFGF mRNA increased in 1994, 127: 37-44 Minimal entorhinal cortex and hippocampus (0.2 s, 50-70 mA Max: increased in bFGF mRNA 3x in in entorhinal cortex, hippocampus, 1 h) and olfactory bulb, striatum and Maximal cerebellum, increased NGF mRNA (0.2 s, 150 mA in entorhinal cortex and 3x in 1 h) hippocampus Kragh et al., Exp Brain Res 10 shocks, 20 1 d, 2 d, 30 d Increased SS-like and NYP-like 1994, 98: 305-13 shocks, or 36 immunoreactivity 1 and 2 d after shocks given last shock in outer part of dentate on weekdays molecular layer Most pronounced only in 36 shock group Butler et al., J Neurochem Acute (1 d or Chronic: increased 5-HT2 1993, 61: 1270-1276 3 d) Chronic receptor mRNA (10 d or 14 d) Passarelli and Orzi Neurosci Repeated (7 d) Repeated: Increased expression of Lett 1993, 153: 197-201 and single somatostantin mRNA in hippocampus Hosoda and Duman, J 10 d Down-regulation of beta 1AR Neurochem 1993, 60: 1335-1343 mRNA in frontal cortex Kapu et al., Brain Res Mol Increased TH mRNA and NPY Brain Res 1993, 18: 121-126 mRNA in Locus Coeruleus Pratt et al., Brain Res Bull 4-24 hrs Increased GABAa receptor 1993, 30: 691-693 subunits alpha 1 and beta 2 mRNA in cerebellum at 4-8 hours, returned to normal at 24 hrs Wong et al., Brain Res Mol Induction of the constitutive hsp72 Brain Res 1992, 13: 19-25 gene in dentate gyrus Lindefors et al., 5 shocks in 10 Increased preprocholecystokinin Neuroscience 1991, 45: 73-80 days and preprotachykinin-A mRNA in Edinger-Westphal nucleus Kang et al., Single 2-8 hrs Increased Alpha 1 and gamma2 Psychopharmacol Bull 1991, GABA receptor subunit mRNA in 27: 359-363 cerebellum and hippocampus not at 2 hrs, increased at 4 hours returned to normal at 8 hours in hippocampus Cole et al., . J Neurochem Single and 15 min. and 15 min: increased zif/268, c-fos, 1990, 55: 1920-1927 repeated 4 hrs c-jun and jun-B in hippocampus and neocortex and pyriform cortex; all except c-jun returned to normal by 4 hours Herman, et al., Brain Res 7 days Increased CRF mRNA in medial 1989, 501: 235-246 parvocellular paraventricular nucleus (PVN) Xei et al., Brain Res Mol Single and 0.5, 2, 6, 12 hrs Single: decrease in DYN mRNA Brain Res 1989, 6: 11-19 repeated (1, (single); at 0.5 hour followed by increase at 3, 6 days) 24 hrs 6 hrs in hippocampus; EK mRNA (repeated) increased at 0.5 hour and still high at 12 hrs in entorhinal cortex and hippocampus Repeated: decreased DYN mRNA in hippocampus at 24 hrs elevated DYN mRNA in striatum and hypothalamus. Increased EK mRNA in entorhinal cortex at 6 hrs Yoshikawa et al., ProcNatl 10 d Increased preproenkephalin mRNA Adad Sci USA 1985, 82: 589-593 in hypothalamus and striatum

[0193] 8.2. ECS Gene Selection Algorithm

[0194] Three considerations were quantified and summed to prioritize the significance of gene “hit” as defined in the statistical analysis portion of Section 7.1. 1, infra. I. Statistical considerations: (High score = 27). Score = p value Score + (0.5) Ratio Score + (0.1) Abundance Score The significance (p value), fold change (ratio), and gene abundance measured in optical intensity units were assigned an integer value as follows: p value Score = −log₁₀(p value) Ratio Score = Ratio if the ratio > 1.5 or 1/Ratio if ratio ≦ 1/1.5 Abundance Score = log₁₀ (Abundance/700) II. Biological considerations: Score = A (10) + B (8) + C (6) + D (5) + E (2) + F (4) (Max. score = 35) A. Gene change is in the opposite direction of mRNA or protein change in human   depression or bipolar disorder, or animal model reported in the literature. B. Same as A, but gene change is in the same direction. C. Gene is known to be changed by effective treatments for human depression or bipolar   disorder D. Gene is part of a genetic or biochemical pathway known to be associated with human   depression, bipolar disorder, or their treatment E. Gene has been reported to change following seizures in human or animal studies. F. Gene's human homologue is in a chromosomal hot spot for depression or bipolar disorder   as identified by linkage analysis. III. Experimental considerations: The experimental results score is based on changes in either or both tissues and after acute, chronic, or both ECS exposures (Max. score = 14) 14 Both tissues after acute and chronic 10 Two tissues, chronic and one tissue acute 8 Both tissues, chronic 6 Two tissues acute, one tissue chronic 4 One tissue chronic, one tissue acute 2 Two tissues, acute 1 One tisue chronic ECS Algorithm Score = 2.2(I) + II + III Maximum score possible = 2.2(27) + 35 + 14 = 108

[0195] 8.3. Fold Change (RATIO), Significance (p VALUE). and Ranking of Genes Identified as Altered in Hippocampus (H) and Frontal Cortex (F) after Either Actue (A) or Chronic (C) ECS Exposure Other Region regions with with Accession p biggest significant Algorithm Gene Name No. RATIO VALUE change change Score *cyclooxygenase isoform S67722 7.33 9E−08 HA All 73.9 (COX-2) putative potassium channel TWIK AF022819 1.83 9E−04 HC HA 14.5 Vascular endothelial growth factor AA850734 1.73 8E−05 HA FA 11.9 (VEGF) *brain-derived neurotrophic factor D10938 4.78 1E−03 HC All 74.8 (BDNF) (exon IV) metallothionein 1 A AI102562 2.04 2E−05 HA FA, HC 62.1 Phosphodiesterase 4B, cAMP- M25350 1.54 3E−05 FA FC 29.8 Specific fos-related antigen (fra-2) AA875032 3.16 8E−05 HA FA, HC 59.9 neuropeptide Y (NPY) M15880 2.14 1E−02 HC FC 54.0 HMG-CoA reductase X55286 1.48 3E−05 HC HA, HC 34.0 TGFB inducible early growth AI172476 4.68 1E−03 HC All 49.0 response (TIEG1) nuclear orphan receptor HZF-3 U01146 1.70 1E−05 HA HC 39.9 (NURR1) Jun B X54686 2.21 1E−07 FA HA 59.3 Jun AI175959 1.84 9E−05 HA FA, FC 58.8 *NMDAR1 glutamate receptor U11418 1.59 2E−02 HA 42.0 subunit *Tissue inhibitor of AI169327 11.39 2E−06 HA All 56.9 metalloproteinase 1 (TIMP1) *Narp S82649 5.89 7E−09 HC All 46.4 cytosolic phospholipase A2, group U38376 1.79 6E−07 HA HC 48.6 IVA silencer factor B (SF-B) X60769 1.76 2E−04 HA All 37.5 Minoxidil sulfotransferase L19998 0.504 4E−03 HC HA, FA 25.6 VGF nerve growth factor inducible M74223 2.35 8E−09 HA All 46.5 phosphatidylinositol 3-kinase p85 D64045 2.02 4E−04 FA 19.5 alpha subunit c-fos X06769 2.22 1E−05 FC HA, HC 44.9 Catalase AA926149 0.60 8E−04 HA 44.1 ceruloplasmin (ferroxidase) L33869 1.81 2E−05 FA HA, HC 33.4 *nerve growth factor-induced M92433 2.41 4E−07 HA HC, FA 42.5 clone C thyrotropin-releasing hormone M23643 2.01 1E−02 HC 42.4 glial fibrillary acidic protein delta; AF028784 1.769 2E−04 FA HA, HC 42.0 glial fibrillary acidic protein alpha HES-1 factor D13417 2.09 5E−03 HC All 26.7 activity and neurotransmitter- AF030089 2.58 6E−07 HA FC, HC 23.8 induced early gene protein 4 (ania- 4) Cytochrome P450 1b1 AI176856 2.247 9E−04 FA HA, HC 25.3 arginine vasopressin (Diabetes M25646 1.52 4E−03 HA FA 39.0 insipidus) enhancer-of-split and hairy-related AF009329 2.06 2E−03 HC All 32.5 protein 1 (SHARP 1) *Protein-tyrosine phosphatase AI180145 1.66 2E−02 HA 26.5 (Ptpn1, non-receptor type 1) Glutathione S-transferase, theta 2 AI138143 0.63 1E−02 HA FA 26.5 protein phosphatase 1, regulatory J05592 1.62 6E−03 HC FA, HA 22.7 (inhibitor) subunit 1A krox20; (early growth response 2) U78102 3.13 7E−04 FA HA, HC 36.2 interleukin 6 receptor M58587 1.74 3E−05 FA 22.2 S-adenosylmethionine AI008131 1.55 1E−02 HA FA 24.8 decarboxylase ets variant gene 3 AA859750 1.62 1E−05 HA All 28.5 *neural receptor protein-tyrosine M55291 1.52 2E−02 HA HC, FA 20.4 kinase (TrkB) *Vesl AB003726 6.50 2E−07 HA FA, HC 33.9 pyruvate dehydrogenase AF062740 1.83 2E−02 HC HA, FA 17.6 phosphatase isoenzyme 1 EST195957 (Mad4 homolog AA892154 0.67 1E−04 HA All 27.2 (MAD4)) MHC class 1b RT1.S3 AI235890 1.808 3E−03 HA HC 32.2 Growth factor receptor bound AI170776 1.68 6E−06 HA HC 32.0 protein 2 (GRB2) prostaglandin D synthase J04488 0.35 6E−03 HC FC 25.5 *metabotropic glutamate receptor M61099 0.60 3E−05 HA 25.5 (GluR1) Decorin AI639233 0.66 2E−02 HA 31.5 aldehyde dehydrogenase M73714 0.45 2E−05 HA 20.6 rx00909s Rattus norvegicus cDNA AI638960 0.62 3E−06 HA HC, FA 17.3 clone activin type I receptor L19341 1.63 3E−05 HC 7.2 growth factor (Arc) U19866 4.82 6E−09 HA FA, HC 29.0 Interferon-related developmental AI014163 2.03 6E−03 HC HA, FA 21.7 regulator 1 neuritin U88958 1.56 1E−08 HA FA, HC 27.7 myr 6 myosin heavy chain U60416 0.65 3E−05 HA HC 26.7 interferon gamma receptor (Ifngr) U68272 0.65 5E−04 FC 20.1 lost on transformation 1 (LOT1) U72620 1.58 8E−05 FA 6.0 DNAJ (Hsp40) homolog, subfamily AA891542 2.32 1E−07 HA HC, FA 25.7 B, member 5 JE product (small inducibe X17053 1.85 3E−03 FA HC, HA 25.4 cytokine A2) Activity and Neurotransmitter AF030088 4.787 4E−05 HC HA 24.9 induced early gene 3; (ania-3) Roaz U92564 0.65 5E−05 HA HC 14.3 3CH134/CL100 PTPase (protein S81478 1.67 8E−03 HA FA 24.3 tyrosine phosphatase, non-receptor 16) matrilysin (Matrix L24374 1.62 9E−03 HA 4.3 metalloproteinase 7) EST194857 AA891054 1.82 5E−03 FC All 24.0 tachykinin 2 (Neurokinin B M16410 3.36 2E−03 HC 22.1 precursor (NKB)) rx04104s (Neurod1) AI639109 0.42 2E−03 HA FA, HC 21.8 EST190175 (atrophin-1 related AA800678 0.64 9E−06 HA HC, FA 21.5 protein) solute carrier family 3, member 1 M80804 0.48 3E−02 FC FC, HA 11.4 flavin-containing monooxygenase 1 M84719 0.67 2E−03 FA HA 10.3 SPA-1 like protein AI237576 1.57 1E−02 HA 4.2 PP1M M110 S74907 1.76 3E−03 HC 19.9 leucine-rich acidic nuclear protein D32209 1.60 4E−02 FC 5.0 BHF-1 D82074 0.44 6E−04 HA FA, HC 19.0 rx01019s Rattus norvegicus cDNA AI639256 0.62 4E−04 HA HC 12.8 clone Nuclear receptor subfamily 4, AI176710 1.62 8E−04 FA HA 10.8 group A, member 3 Isovaleryl Coenzyme A AI102838 0.64 3E−03 HC 18.4 dehydrogenase insulin-like growth factor II X17012 0.44 5E−02 HC FC 17.8 (somatomedin A) GADD153 U30186 3.16 8E−05 HA 17.0 UI-R-E0-bs-h-03-0-UI.s1 UI-R-E0 AA859627 1.56 2E−02 HA 5.0 Rattus norvegicus cDNA clone Calvasculin; S100 calcium-binding X06916 1.58 3E−02 HC 16.8 protein A4 EST189184 AA799687 0.65 2E−03 HA 4.8 developmentally regulated protein AI012275 0.61 4E−04 HA HC 10.6 TPO1 UI-R-A0-bd-e-03-0-UI.s1 UI-R-A0 AA866485 1.53 1E−03 FA HA 10.0 Rattus norvegicus cDNA clone cyclic AMP-regulated S65091 2.09 7E−07 HA HC 15.8 phosphoprotein di-N-acetylchitobiase M95768 0.66 3E−05 HA 5.8 mothers against dpp 3 homolog U66479 1.78 3E−03 HA 15.7 EST225243 (S100 calcium binding AI228548 0.65 3E−02 HC 15.7 protein A1) CCAAT/enhancerbinding, protein M65149 2.95 2E−05 FA FC 15.4 (C/EBP) delta EST196314 (tescalcin) AA892511 1.53 1E−02 HC HA 9.1 EST220045 AI176460 7.81 3E−01 FC 15.0 Potassium voltage gated channel, AI230211 2.437 5E−02 HA FC 14.3 Shal-related family member 2 transducin-like enhancer of split AA875084 0.60 1E−06 HA FA 14.0 1(Tle1) *Fibroblast growth factor receptor 1 D12498 1.51 2E−03 HA 7.7 chemokine CX3C AF030358 1.59 1E−05 HA HC 12.9 neuropilin AF016296 1.67 2E−03 HA HC 12.8 EST197009 AA893206 1.63 4E−05 HA HC 12.8 EST190350 AA800853 1.55 2E−05 FA HA 12.4 EST213163 (FK506 binding AI103874 0.72 3E−04 HC HA 12.0 protein 3) DNA-damage-inducible transcript 1 L32591 1.68 2E−03 HC 5.8 thymosin beta-10 M58404 1.51 3E−04 HC 3.8 Y box protein 1 AI230572 1.53 7E−03 HA 3.6 phosducin-like protein L15354 0.56 6E−06 HA FA 11.3 Ash-s D49847 1.56 1E−03 HC HA 11.2 connexin protein Cx26 (gap X51615 0.66 1E−02 FA HC 11.2 junction beta-2 protein) Early growth response 1 (EGR1) AI176662 1.63 1E−03 HA FA 11.1 EST215655 (similar to G33) AI169756 1.78 3E−03 FA HA 10.7 rx04422s Rattus norvegicus cDNA AI639169 0.66 5E−04 HA 4.6 clone protein tyrosine phosphatase AJ007016 1.51 3E−03 HA FA 10.4 (PtP4a2) EST195538 AA891735 1.58 3E−04 HA HC 10.3 rx04826s Rattus norvegicus cDNA AI639058 1.62 2E−02 HC HA 10.0 clone (similar to Nedd4) UI-R-E0-ci-e-02-0-UI.s1 UI-R-E0 AA874943 0.64 3E−04 HA HC 9.2 Rattus norvegicus cDNA clone Transcription factor UBF M61725 1.58 8E−03 HA HC 8.8 EST203592 (chromobox homolog AI009141 0.63 3E−02 HC HA 8.7 1) Lysyl oxidase S77494 2.407 6E−03 HC 8.5 Small proline-rich protein gene AA891911 1.52 2E−02 HC 6.9 (cornifin alpha) EST196165 AA892362 0.65 3E−04 HA 6.7 EST106597 H31990 0.64 1E−03 HC 6.6 heme oxygenase-3 AF058787 1.61 9E−03 HC 6.5 EST197625 (RAD52 homolog) AA893822 0.64 9E−04 HA 6.3 protein kinase MNK2 Z21935 1.81 2E−02 HA 6.2 SHB (Src homology 2 domain AA859468 1.77 7E−03 HC HA 5.2 containing) adaptor protein B rx00382s Rattus norvegicus cDNA AI639155 0.56 3E−05 HA 5.2 clone UI-R-E0-bx-e-11-0-UI.s1 UI-R-E0 AA859690 1.94 3E−08 HC 4.7 Rattus norvegicus cDNA clone EST190198 (coagulation factor III) AA800701 1.58 2E−04 HA 4.4 EST190228 AA800731 0.61 7E−03 FC 4.1 EST197395 (reticulocalbin 2) AA893592 1.67 5E−02 HC 4.1 nuclear pore complex protein U41845 1.52 2E−04 HA 3.9 Nup50 *clathrin, light polypeptide (LCB) AA874955 1.62 4E−03 HA 3.8 NAC-1 protein AF015911 1.63 1E−02 HA 3.8 beta defensin-1 AF068860 1.24 2E−02 HA 3.7 voltage-dependent potassium X12589 1.83 4E−02 HA 3.5 channel; (shaker-related subfamily) rx00967s Rattus norvegicus cDNA AI639015 0.61 4E−04 HA 3.1 clone Beta-carotene 15, 15′-dioxygenase AI014135 0.59 2E−02 FA 2.9 BRL-3A binding protein A09811 0.32 1E−02 HC 2.9 potassium voltage gated channel, M32867 1.61 1E−02 HC 2.7 shaker related subfamily, member 4

[0196] 8.4. Supplemental List of Highly Specific Genes that Change Less than 1.5-Fold (Columns are as Defined in Section 8.3, supra) Other Region regions with with Accession p lowest p significant Algorithm Gene No. RATIO VALUE value change Score Bcl-2 associated death agonist AA818072 0.8 3E−05 HA HC 32 (BAD) beta-tubulin T beta 15 (aa 1-445) X03369 1.27 1E−05 HC 8 diacylglycerol kinase S49760 1.37 4E−04 HC 7 EST 189694 Normalized rat AA800197 0.69 4E−07 HA HC 11 heart EST 195383 Normalized rat AA891580 0.67 2E−05 HC HA 9 kidney EST 197783 Normalized rat AA893980 0.76 6E−06 HA HC 10 placenta GABA-A receptor X51992 0.79 2E−04 HC HA 19 Glypican L02896 1.28 3E−03 HC FA, FC 13 Growth factor receptor bound AA801130 1.23 5E−04 HC HA, FC 16 protein 2 Light molecular weight AF031880 1.4 1E−05 HA HC 24 neurofilament Neurofilament protein middle Z12152 1.26 4E−08 HA 23 (NF-M) Neurofilament protein; rat M25638 1.49 2E−07 HA HC 22 smallest (NF-L) Neuron glucose transporter D13962 1.4 7E−05 FA 5 Phospholipase C-beta 1 L14323 0.79 3E−05 HA 24 Phospholipase D gene 2 NM_033299 0.72 1E−04 HA HC 23 Ras homolog enriched in brain A1009605 1.42 3E−03 HA HC 12 SH3-domain GRB2-like 1 A1010110 1.26 1E−05 HA 12 Sodium-dependent S56141 0.78 1E−05 HA 5 neurotransmitter transporter, ventral midbrain Synapsin 2 M27925 1.34 4E−04 FC 6

[0197] 8.5. Confirmation of Select Gene Changes by Quantitative RT-PCR

[0198] All samples for ECS-treated groups and sham control groups used for microarray studies were analyzed for each gene (n=8-10 per group). QRT-PCR Microarray UniGene Name Ratio p value Ratio p value Frontal Cortex, Acute ECS Rn.32777 Ceruloplasmin (ferroxidase) 1.35 2.8E−02 1.81 2.0E−05 Rn.10599 Phosphoinositide 3-kinase, regulatory 1.59 5.8E−04 2.02 4.0E−03 subunit, polypeptide 1 Rn.6977 Lost on transformation 1 1.90 2.1E−04 1.58 8.0E−05 Rn.1716 Interleukin 6 receptor 2.24 3.1E−08 1.74 3.0E−05 Frontal Cortex, Chronic ECS Rn.11400 Prostaglandin D synthase 0.61 1.0E−02 0.71 0.03 Rn.7730 Prostaglandin E synthase 2.18 1.5E−02 not on U34A chip Rn.506 Synapsin 2 1.29 1.0E−03 1.34 0.00 Hippocampus, Acute ECS Rn.87787 glutamate receptor, 0.63 2.5E−06 0.60 3.0E−05 metabotropic 1 Rn.19727 Hairy and enhancer of split 1 0.71 3.3E−02 1.55 4.0E−05 (Drosophila) Rn.3812 Normalized rat heart, Bento 0.72 1.4E−05 0.65 2.0E−03 Soares Rattus sp. cDNA RHEAE07 Rn.9113 Aldehyde dehydrogenase 0.86 5.6E−03 0.44 2.0E−05 family 3, subfamily A2 Rn.3360 Growth factor receptor 1.36 3.1E−02 1.68 6.0E−06 bound protein 2 Rn.10815 Neuropilin 1.36 1.2E−04 1.67 2.0E−03 Rn.9797 Fibroblast growth factor 1.44 1.6E−03 1.51 2.0E−03 receptor 1 Rn.2045 Protein tyrosine 1.46 2.0E−03 1.51 3.0E−03 phosphatase type IVA, member 2 Rn.3546 Neuritin 1.59 4.9E−06 1.56 1.0E−08 Rn.9839 Nuclear receptor subfamily 1.76 6.8E−05 1.70 1.0E−05 4, group A, member 2 Rn.44320 Avian sarcoma virus 17 (v- 1.80 1.7E−07 1.84 9.0E−05 jun) oncogene homolog Rn.859 Ras homolog enriched in 1.83 1.0E−07 1.42 1.0E−05 brain Rn.11183 DNA-damage inducible 1.89 1.6E−08 1.78 4.0E−03 transcript 3 Rn.3723 Interferon-related 1.93 1.3E−07 2.01 3.0E−05 developmental regulator 1 D82074 Rattus sp. mRNA for BHF-1 1.97 7.0E−04 0.44 6.0E−04 Z21935 R. norvegicus protein kinase 2.28 1.6E−02 1.81 2.0E−02 rMNK2 S65091 cyclic AMP-regulated 2.52 6.7E−05 2.09 7.0E−07 phosphoprotein Rn.2398 TGFB inducible early growth 2.79 1.4E−08 3.62 1.0E−06 response Rn.40517 Activity and 3.64 2.9E−10 2.58 6.0E−07 neurotransmitter-induced early gene protein 4 (ania-4) Rn.44369 Prostaglandin-endoperoxide 4.03 1.5E−07 7.33 9.0E−08 synthase 2 Rn.25754 Tissue inhibitor of 4.71 8.7E−08 11.39 2.0E−06 metalloproteinase 1 Rn.11266 Brain derived neurotrophic 5.73 2.9E−08 3.85 2.0E−04 factor S82649 neuronal activity-regulated 7.24 2.0E−10 5.89 7.0E−09 pentraxin Rn.37500 Homer, neuronal immediate 7.42 2.5E−07 6.50 2.0E−07 early gene, 1 Hippocampus, Chronic ECS Rn.964 Insulin-like growth factor II 0.33 4.4E−02 0.44 5.0E−02 (somatomedin A) Rn.11400 Prostaglandin D synthase 0.34 3.9E−02 0.35 6.0E−03 Rn.19727 Hairy and enhancer of split 1 1.27 1.8E−02 2.09 5.0E−03 (Drosophila) M25638 Rat smallest neurofilament 1.31 2.1E−03 protein (NF-L) mRNA Rn.7044 Glypican 1 1.33 1.6E−02 Rn.78188 heme oxygenase-3 (HO-3) 1.42 2.3E−02 1.61 9.0E−03 mRNA Rn.859 Ras homolog enriched in 1.54 6.7E−04 brain Rn.9756 Protein phosphatase 1, 1.58 4.8E−06 1.62 6.0E−03 regulatory (inhibitor) subunit 1A Rn.3546 Neuritin 1.58 8.6E−05 1.44 8.0E−04 Rn.31799 Pyruvate dehydrogenase 1.72 1.3E−03 1.83 2.0E−02 phosphatase isoenzyme 1 Rn.9714 Neuropeptide Y 2.40 5.2E−06 2.14 1.0E−02 Rn.22 Thyrotropin releasing 21.42 1.8E−03 2.01 1.0E−02 hormone

[0199] 8.6. Electroconvulsive Seizure (ECS) Signature Genes PGI Data, Group II: Scores Algorithm Scores Unigene number for gene overlaps with: Groups I, II ECS Gene Name Accession Rat Human Mouse Human Human Mouse + (Literature) Algorithm (SEQ ID NO.) No. unigene Unigene Unigene tissue cells Rat tissue PGI II and III Score cyclooxygenase isoform S67722 Rn.44369 Hs.196384 Mm.3137 8 6 0 14 73.9 87.9 (COX-2) (SEQ ID NO: 1) putative potassium AF022819 Rn.15693 Hs.79351 Mm.10800 50 12 0 62 14.5 76.5 channel TWIK (SEQ ID NO: 2) Vascular endothelial AA850734 Rn.1923 Hs.73793 Mm.31540 46 6 12 64 11.9 75.9 growth factor (VEGF) (SEQ ID NO: 3) brain-derived neurotrophic AI030286 Rn.11266 Hs.56023 Mm.1442 0 0 0 0 74.8 74.8 factor (BDNF) (exon IV) (SEQ ID NO: 4) metallothionein 1 A AI102562 Rn.54397 Mm.192991 Mm.192991 0 0 6 6 62.1 68.1 (SEQ ID NO: 5) Phosphodiesterase 4B, M25350 Rn.2485 Hs.188 Mm.20181 38 0 0 38 29.8 67.8 cAMP-Specific (SEQ ID NO: 6) fos-related antigen (fra-2) AA875032 Rn.3212 NA NA 0 0 6 6 59.9 65.9 (SEQ ID NO: 7) HMG-CoA reductase X55286 Rn.9437 Hs.11899 NA 24 6 0 30 34 64 (SEQ ID NO: 8) neuropeptide Y (NPY) M15880 Rn.9714 Hs.1832 Mm.154796 10 0 0 10 54 64 (SEQ ID NO: 9) TGFB inducible early AI071299 Rn.2398 Hs.82173 Mm.4292 0 12 0 12 49 61 growth response (TIEG1) (SEQ ID NO: 10) nuclear orphan receptor U01146 Rn.88129 Hs.82120 Mm.3507 8 0 12 20 39.9 59.9 HZF-3 (NURR1) (SEQ ID NO: 11) FGF receptor-1 D12498 Rn.9797 Hs.748 Mm.3157 40 12 0 52 7.7 59.7 (SEQ ID NO: 12) Jun B X54686 NA NA NA 0 0 0 0 59.3 59.3 (SEQ ID NO: 13) Jun AI175959 Rn.44320 NA NA 0 0 0 0 58.8 58.8 (SEQ ID NO: 14) NMDAR1 glutamate U11418 Rn.9840 Hs.105 Mm.3292 16 0 0 16 42 58 receptor subunit (SEQ ID NO: 15) Tissue inhibitor of AI169327 Rn.25754 NA NA 0 0 0 0 56.9 56.9 metalloproteinase 1 (TIMP1) (SEQ ID NO: 16) Narp S82649 NA NA NA 10 0 0 10 46.4 56.4 (SEQ ID NO: 17) cytosolic phospholipase U38376 Rn.10162 Hs.211587 Mm.4186 0 6 0 6 48.6 54.6 A2, group IVA (SEQ ID NO: 18) silencer factor B (SF-B) X60769 Rn.6479 Hs.99029 Mm.4863 0 6 6 12 37.5 49.5 (SEQ ID NO: 19) Minoxidil sulfotransferase L19998 Rn.1507 Hs.142 Mm.17339 10 0 12 22 25.6 47.6 (SEQ ID NO: 20) VGF nerve growth factor M74223 Rn.9704 Hs.171014 NA 0 0 0 0 46.5 46.5 inducible (SEQ ID NO: 21) phosphatidylinositol 3- D64045 Rn.10599 Hs.6241 Mm.3058 26 0 0 26 19.5 45.5 kinase p85 alpha subunit (SEQ ID NO: 22) c-fos X06769 NA NA NA 0 0 0 0 44.9 44.9 (SEQ ID NO: 23) Catalase AA891848 NA Hs.395771 Mm.4215 0 0 0 0 44.1 44.1 (SEQ ID NO: 24) phospholipase C-beta 1 L14323 Rn.45523 Hs.41143 Mm.42083 10 0 0 10 34 44 (SEQ ID NO: 25) ceruloplasmin L33869 Rn.32777 Hs.296634 Mm.13787 10 0 0 10 33.4 43.4 (ferroxidase) (SEQ ID NO: 26) nerve growth factor- M92433 NA NA NA 0 0 0 0 42.5 42.5 induced clone C (SEQ ID NO: 27) thyrotropin-releasing M23643 NA NA NA 0 0 0 0 42.4 42.4 hormone (& precursor) (SEQ ID NO: 28) glial fibrillary acidic protein AF028784 NA NA NA 0 0 0 0 42 42 delta; glial fibrillary acidic protein alpha (SEQ ID NO: 29) HES-1 factor D13417 Rn.19727 Hs.250666 Mm.4451 8 6 0 14 26.7 40.7 (SEQ ID NO: 30) activity and AF030089 Rn.40517 Hs.21355 Mm.44752 10 0 6 16 23.8 39.8 neurotransmitter-induced early gene protein 4 (ania- 4) (SEQ ID NO: 31) Cytochrome P450 1b1 AI176856 Rn.10125 Hs.154654 Mm.4443 8 6 0 14 25.3 39.3 (SEQ ID NO: 32) arginine vasopressin M25646 Rn.9976 Hs.89648 Mm.6190 0 0 0 0 39 39 (Diabetes insipidus) (SEQ ID NO: 33) enhancer-of-split and AF009329 Rn.10784 Hs.33829 Mm.89873 0 0 6 6 32.5 38.5 hairy-related protein 1 (SHARP 1) (SEQ ID NO: 34) Glutathione S-transferase, AI138143 Rn.87212 NA NA 0 12 0 12 26.5 38.5 theta 2 (SEQ ID NO: 35) protein phosphatase 1, J05592 Rn.9756 Hs.76780 Mm.143788 8 0 6 14 22.7 36.7 regulatory (inhibitor) subunit 1A (SEQ ID NO: 36) interleukin 6 receptor M58587 Rn.1716 Hs.193400 Mm.2856 8 6 0 14 22.2 36.2 (SEQ ID NO: 37) krox20; (early growth U78102 Rn.89235 Hs.1395 Mm.1353 0 0 0 0 36.2 36.2 response 2) (SEQ ID NO: 38) light molecular weight AF031880 Rn.18568 Hs.211584 Mm.1956 0 6 0 6 30 36 neurofilament (SEQ ID NO: 39) neurofilament protein Z12152 Rn.10971 Hs.71346 Mm.142140 0 6 0 6 29 35 middle (NF-M) (SEQ ID NO: 40) S-adenosylmethionine AI008131 Rn.29949 Hs.262476 Mm.7880 10 0 0 10 24.8 34.8 decarboxylase (SEQ ID NO: 41) ets variant gene 3 AA859750 Rn.7937 NA NA 0 0 6 6 28.5 34.5 (SEQ ID NO: 42) neural receptor protein- M55291 Rn.11246 Hs.47860 Mm.3993 8 6 0 14 20.4 34.4 tyrosine kinase (TrkB) (SEQ ID NO: 43) neurofilament protein; rat M25638 Rn.18568 Hs.211584 Mm.1956 0 6 0 6 28 34 smallest (NF-L) (SEQ ID NO: 44) Vesl AB003726 Rn.37500 NA NA 0 0 0 33.9 33.9 (SEQ ID NO: 45) pyruvate dehydrogenase AF062740 Rn.31799 Hs.22265 NA 16 0 0 16 17.6 33.6 phosphatase isoenzyme 1 (SEQ ID NO: 46) EST195957 (Mad4 AA892154 Rn.3279 NA NA 0 0 6 6 27.2 33.2 homolog) (SEQ ID NO: 47) MHC class 1b RT1.S3 AI235890 Rn.40130 NA NA 0 0 0 0 32.2 32.2 (SEQ ID NO: 48) Bad: bcl-2 associated AA818072 Rn.36696 Hs.76366 Mm.4387 0 0 0 0 32 32 death agonist (SEQ ID NO: 49) Growth factor receptor AI170776 Rn.3360 Hs.381152 Mm.6900 0 0 0 0 32 32 bound protein 2 (GRB2) (SEQ ID NO: 50) Growth factor receptor AA801130 Rn.3360 Hs.381152 Mm.6900 0 0 0 0 32 32 bound protein 2 (GRB2) (SEQ ID NO: 51) Decorin AI639233 Rn.3819 Hs.433989 Mm.56769 0 0 0 0 31.5 31.5 (SEQ ID NO: 52) metabotropic glutamate M61099 Rn.87787 Hs.32945 Mm.157764 0 6 0 6 25.5 31.5 receptor (GluR1) (SEQ ID NO: 53) prostaglandin D synthase J04488 Rn.11400 Hs.430637 Mm.1008 0 0 6 6 25.5 31.5 (SEQ ID NO: 54) gamma-aminobutyric acid X51992 Rn.10368 Hs.24969 Mm.261561 0 6 0 6 25 31 A receptor, alpha 5 (SEQ ID NO: 55) aldehyde dehydrogenase M73714 Rn.9113 Hs.159608 Mm.4210 10 0 0 10 20.6 30.6 (SEQ ID NO: 56) rx00909s Rattus AI638960 Rn.16596 NA NA 0 0 12 12 17.3 29.3 norvegicus cDNA clone (SEQ ID NO: 57) activin type I receptor L19341 Rn.87899 Hs.150402 Mm.689 16 0 6 22 7.2 29.2 (SEQ ID NO: 58) growth factor (Arc) U19866 Rn.10086 NA Mm.25405 0 0 0 0 29 29 (SEQ ID NO: 59) Interferon-related AI014163 Rn.3723 Hs.7879 Mm.168 0 0 6 6 21.7 27.7 developmental regulator 1 (SEQ ID NO: 60) neuritin U88958 Rn.3546 NA NA 0 0 0 0 27.7 27.7 (SEQ ID NO: 61) myr 6 myosin heavy chain U60416 Rn.10640 NA Mm.3536 0 0 0 0 26.7 26.7 (SEQ ID NO: 62) interferon gamma receptor U68272 Rn.19927 Hs.180866 Mm.549 0 6 0 6 20.1 26.1 (Ifngr) (SEQ ID NO: 63) lost on transformation 1 U72620 Rn.6977 Hs.75825 Mm.220978 8 12 0 20 6 26 (LOT1) (SEQ ID NO: 64) Synapsin 2 M27925 Rn.506 Hs.6439 Mm.20892 10 0 0 10 16 26 (SEQ ID NO: 65) DNAJ (Hsp40) homolog, AA891542 Rn.4189 NA NA 0 0 0 0 25.7 25.7 subfamily B, member 5 (SEQ ID NO: 66) JE product (small inducibe X17053 NA NA NA 0 0 0 0 25.4 25.4 cytokine A2) (SEQ ID NO: 67) Activity and AF030088 Rn.37500 NA NA 0 0 0 0 24.9 24.9 Neurotransmitter induced early gene 3; (ania-3) (SEQ ID NO: 68) matrilysin (Matrix L24374 Rn.10282 Hs.2256 Mm.4825 8 12 0 20 4.3 24.3 metalloproteinase 7) SEQ ID NO: 69) 3CH134/CL100 PTPase S81478 NA NA NA 0 0 0 0 24.3 24.3 (SEQ ID NO: 70) Roaz U92564 Rn.9981 Hs.137168 Mm.23452 10 0 0 10 14.3 24.3 (SEQ ID NO: 71) EST194857 AA891054 Rn.4287 NA NA 0 0 0 0 24 24 (SEQ ID NO: 72) EST 189694 Normalized AA800197 Rn.3866 NA NA 0 0 6 6 17 23 rat heart (SEQ ID NO: 73) phospholipase D gene 2 NM_033299 Rn.9798 Hs.104519 Mm.121970 0 0 0 0 23 23 (SEQ ID NO: 74) tachykinin 2 (Neurokinin B M16410 Rn.9708 Hs.9730 Mm.2374 0 0 0 0 22.1 22.1 precursor (NKB)) (SEQ ID NO: 75) rx04104s (Neurd1) AI639109 Rn.43903 NA NA 0 0 0 0 21.8 21.8 (SEQ ID NO: 76) EST190175 (atrophin-1 AA800678 Rn.98495 NA NA 0 0 0 0 21.5 21.5 related protein) (SEQ ID NO: 77) solute carrier family 3, M80804 Rn.11196 Hs.239106 Mm.227176 10 0 0 10 11.4 21.4 member 1 (SEQ ID NO: 78) Neuron glucose D13962 Rn.95055 Hs.7594 Mm.3726 8 0 0 8 13 21 transporter (SEQ ID NO: 79) flavin-containing M84719 Rn.867 Hs.1424 Mm.976 10 0 0 10 10.3 20.3 monooxygenase 1 (SEQ ID NO: 80) SPA-1 like protein AI237576 Rn.10835 Hs.172180 Mm.203907 16 0 0 16 4.2 20.2 (SEQ ID NO: 81) PP1MM110 S74907 NA NA NA 0 0 0 0 19.9 19.9 (SEQ ID NO: 82) BHF-1 D82074 NA NA NA 0 0 0 0 19 19 (SEQ ID NO: 83) leucine-rich acidic nuclear D32209 Rn.10123 Hs.285013 Mm.613 8 6 0 14 5 19 protein (SEQ ID NO: 84) Nuclear receptor AI176710 Rn.62694 Hs.80561 Mm.101224 8 0 0 8 10.8 18.8 subfamily 4, group A, member 3 (NOR1) (SEQ ID NO: 85) rx01019s Rattus AI639256 Rn.40672 NA NA 0 0 6 6 12.8 18.8 norvegicus cDNA clone (SEQ ID NO: 86) Isovaleryl Coenzyme A AI102838 Rn.147 Hs.374536 Mm.6635 0 0 0 0 18.4 18.4 dehydrogenase (SEQ ID NO: 87) insulin-like growth factor II X17012 NA NA NA 0 0 0 0 17.8 17.8 (somatomedin A) (SEQ ID NO: 88) GADD153 U30186 Rn.11183 Hs.400353 Mm.7549 0 0 0 0 17 17 (SEQ ID NO: 89) UI-R-E0-bs-h-03-0-UI.s1 AA859627 Rn.25 NA NA 0 0 12 12 5 17 UI-R-E0 Rattus norvegicus cDNA clone (SEQ ID NO: 90) Calvasculin; S100 X06916 NA NA NA 0 0 0 0 16.8 16.8 calcium-binding protein A4 (SEQ ID NO: 91) UI-R-A0-bd-e-03-0-UI.s1 AA866485 Rn.3018 NA NA 0 0 6 6 10 16 UI-R-A0 Rattus norvegicus cDNA clone (SEQ ID NO: 92) cyclic AMP-regulated S65091 NA NA NA 0 0 0 0 15.8 15.8 phosphoprotein (SEQ ID NO: 93) di-N-acetylchitobiase M95768 Rn.11199 Hs.135578 Mm.45396 10 0 0 10 5.8 15.8 (SEQ ID NO: 94 EST225243 (S100 AI228548 Rn.11091 Hs.433503 Mm.24662 0 0 0 0 15.7 15.7 calcium binding protein A1) (SEQ ID NO: 95) mothers against dpp 3 U66479 Rn.10636 Hs.288261 Mm.7320 0 0 0 0 15.7 15.7 homolog (SEQ ID NO: 96) CCAAT/enhancerbinding, M65149 Rn.6975 NA NA 0 0 0 0 15.4 15.4 protein (C/EBP) delta (SEQ ID NO: 97) EST196314 (tescalcin) AA892511 Rn.14758 Hs.18791 Mm.26378 0 6 0 6 9.1 15.1 (SEQ ID NO: 98) EST220045 AI176460 NA NA NA 0 0 0 0 15 15 (SEQ ID NO: 99) Potassium voltage gated AI230211 Rn.10540 Hs.184889 Mm.56904 0 0 0 0 14.3 14.3 channel, Shal-related family member 2 (SEQ ID NO: 100) transducin-like enhancer AA875084 Rn.6875 NA NA 0 0 0 0 14 14 of split 1(Tle1) (SEQ ID NO: 101) Glypican 1 L02896 Rn.7044 Hs.2699 Mm.24193 0 0 0 0 13 13 (SEQ ID NO: 102) chemokine CX3C AF030358 Rn.4106 Hs.80420 Mm.103711 0 0 0 0 12.9 12.9 (SEQ ID NO: 103) EST197009 AA893206 Rn.3665 NA NA 0 0 0 0 12.8 12.8 (SEQ ID NO: 104) neuropilin AF016296 Rn.10815 Hs.69285 Mm.27448 0 0 0 0 12.8 12.8 (SEQ ID NO: 105) EST190350 AA800853 Rn.36357 NA NA 0 0 0 0 12.4 12.4 (SEQ ID NO: 106) EST213163 (FK506 AI103874 Rn.1464 NA NA 0 0 0 0 12 12 binding protein 3) (SEQ ID NO: 107) Protein-tyrosine AI180145 Rn.11317 Hs.155894 Mm.259235 0 6 6 12 0 12 phosphatase (Ptpn1, non- receptor type 1) (SEQ ID NO: 108) Ras homolog enriched in A1009605 NA NA NA 0 0 0 0 12 12 brain (SEQ ID NO: 109) SH3-domain GRB2-like 1 A1010110 NA NA NA 0 0 0 0 12 12 (SEQ ID NO: 110) DNA-damage-inducible L32591 Rn.10250 Hs.80409 Mm.1236 0 6 0 6 5.8 11.8 transcript 1 (SEQ ID NO: 111) Early growth response 1 AI176662 Rn.9096 Hs.326035 Mm.181959 0 0 0 0 11.8 11.8 (EGR1) (SEQ ID NO: 112) thymosin beta-10 M58404 Rn.5983 Hs.76293 NA 8 0 0 8 3.8 11.8 (SEQ ID NO: 113) Y box protein 1 AI230572 Rn.3181 Hs.74497 Mm.21054 8 0 0 8 3.6 11.6 (SEQ ID NO: 114) phosducin-like protein L15354 Rn.51153 Hs.9302 Mm.30709 0 0 0 0 11.3 11.3 (SEQ ID NO: 115) Ash-s D49847 Rn.3360 Hs.381152 Mm.6900 0 0 0 0 11.2 11.2 (SEQ ID NO: 116) connexin protein Cx26 X51615 NA NA NA 0 0 0 0 11.2 11.2 (gap junction beta-2 protein) (SEQ ID NO: 117) EST215655 (similar to AI169756 Rn.10581 NA NA 0 0 0 0 10.7 10.7 G33) (SEQ ID NO: 118) rx04422s Rattus AI639169 Rn.43195 Hs.296420 Mm.41508 0 0 6 6 4.6 10.6 norvegicus cDNA clone (SEQ ID NO: 119) protein tyrosine AJ007016 Rn.1072 NA NA 0 0 0 0 10.4 10.4 phosphatase (PtP4a2) (SEQ ID NO: 120) EST195538 AA891735 Rn.22703 NA NA 0 0 0 0 10.3 10.3 (SEQ ID NO: 121) EST 197783 Normalized AA893980 Rn.7498 NA NA 0 0 0 0 10 10 rat placenta (SEQ ID NO: 122) rx04826s Rattus AI639058 Rn.20963 NA NA 0 0 0 0 10 10 norvegicus cDNA clone (similar to Nedd4) (SEQ ID NO: 123) UI-R-E0-ci-e-02-0-UI.s1 AA874943 Rn.4040 NA NA 0 0 0 0 9.2 9.2 UI-R-E0 Rattus norvegicus cDNA clone (SEQ ID NO: 124) EST 195383 Normalized AA891580 Rn.22698 NA NA 0 0 0 0 9 9 rat kidney (SEQ ID NO: 125) Transcription factor M61725 Rn.22469 NA NA 0 0 0 0 8.8 8.8 UBF(SEQ ID NO: 126) EST203592 (chromobox AI009141 Rn.29900 NA NA 0 0 0 0 8.7 8.7 homolog 1) (SEQ ID NO: 127) Lysyl oxidase S77494 NA NA NA 0 0 0 0 8.5 8.5 (SEQ ID NO: 128) beta-tubulin T beta15 (aa X03369 Rn.37849 NA NA 0 0 0 0 8 8 1-445) (SEQ ID NO: 129) diacylglycerol kinase S49760 NA NA NA 0 0 0 0 7 7 (SEQ ID NO: 130) Small proline-rich protein AA891911 Rn.14720 NA NA 0 0 0 0 6.9 6.9 gene (cornilin alpha) (SEQ ID NO: 131) EST196165 AA892362 Rn.14752 NA NA 0 0 0 0 6.7 6.7 (SEQ ID NO: 132) EST106597 H31990 Rn.22664 NA NA 0 0 0 0 6.6 6.6 (SEQ ID NO: 133) heme oxygenase-3 AF058787 NA NA NA 0 0 0 0 6.5 6.5 (SEQ ID NO: 134) EST197625 (RAD52 AA893822 Rn.8154 NA NA 0 0 0 0 6.3 6.3 homolog) (SEQ ID NO: 135) protein kinase MNK2 Z21935 Rn.92317 NA NA 0 0 0 0 6.2 6.2 (SEQ ID NO: 136) rx00382s Rattus AI639155 Rn.96446 NA NA 0 0 0 0 5.2 5.2 norvegicus cDNA clone (SEQ ID NO: 137) SHB (Src homology 2 AA859468 Rn.226 NA NA 0 0 0 0 5.2 5.2 domain containing) adaptor protein B (SEQ ID NO: 138) Sodium-dependant S56141 NA NA NA 0 0 0 0 5 5 neurotransmitter transporter (SEQ ID NO: 139) UI-R-E0-bx-e-11-0-UI.s1 AA859690 Rn.51 NA NA 0 0 0 0 4.7 4.7 UI-R-E0 Rattus norvegicus cDNA clone (SEQ ID NO: 140) EST190198 (coagulation AA800701 Rn.97659 NA NA 0 0 0 0 4.4 4.4 factor III) (SEQ ID NO: 141) EST190228 AA800731 Rn.6626 NA NA 0 0 0 0 4.1 4.1 (SEQ ID NO: 142) EST197395 (reticulocalbin AA893592 Rn.98490 NA NA 0 0 0 0 4.1 4.1 2) (SEQ ID NO: 143) nuclear pore complex U41845 Rn.3242 Hs.367697 Mm.28379 0 0 0 0 3.9 3.9 protein Nup50 (SEQ ID NO: 144) clathrin, light polypeptide AA874955 Rn.3440 Hs.380749 Mm.36879 0 0 0 0 3.8 3.8 (LCB) (SEQ ID NO: 145) NAC-1 protein AF015911 Rn.94891 NA NA 0 0 0 0 3.8 3.8 (SEQ ID NO: 146) beta defensin-1 AF068860 Rn.31800 NA Mm.5341 0 0 0 0 3.7 3.7 (SEQ ID NO: 147) voltage-dependent X12589 NA NA NA 0 0 0 0 3.5 3.5 potassium channel; (shaker-related subfamily) (SEQ ID NO: 148) rx00967s Rattus AI639015 Rn.28835 NA NA 0 0 0 0 3.1 3.1 norvegicus cDNA clone (SEQ ID NO: 149) Beta-carotene 15,15′- AI014135 NA NA NA 0 0 0 0 2.9 2.9 dioxygenase (SEQ ID NO: 150) BRL-3A binding protein A09811 NA NA NA 0 0 0 0 2.9 2.9 (SEQ ID NO: 151) potassium voltage gated M32867 Rn.9884 Hs.1854 Mm.142718 0 0 0 0 2.7 2.7 channel, shaker related subfamily, member 4 (SEQ ID NO: 152)

[0200] 9. References Cited

[0201] Numerous references, including patents, patent applications and various publications, are cited and discussed in the description of this invention. The citation and/or discussion of such references is provided merely to clarify the description of the present invention and is not an admission that any such reference is “prior art” to the invention described here. All references cited and/or discussed in this specification (including references, e.g., to biological sequences or structures in the GenBank, PDB or other public databases) are incorporated herein by reference in their entirety and to the same extent as if each reference was individually incorporated by reference.

1 152 1 4154 DNA Rattus norvegicus misc_feature (1)..(4154) where n may be a or g or c or t/u, unknown, or other 1 cttcaggagt acgaagaccc tgcctacgaa ggaactcagc tctgtgttcc tgccagctcc 60 cccgccagct tcacttgcca ccaacgctgc cacaactgct gccaccaccg ctgccacctc 120 tgcgatgctc ttccgagctg tgctgctctg cgcttgccct ggcctcagcc atgcagcaaa 180 tccttgctgt tccaacccat gtcaaaaccg tggtgaatgt atgagcatag gatttgacca 240 atataaatgt gactgtaccc ggactggatt ctacggtgaa aactgtacta cgccgagatt 300 cctgacaaga atcaaattac cgctgaagcc caccccaaac acagtacact acatcctgac 360 ccacttcaag ggagtctgga acattgtgaa caacattccc ttccttcgaa ttcaatccat 420 gagatacgtg ttgacgtcca gatcacattt gattgacagc ccaccaactt acaatgtgca 480 ctacggttac aaaagttggg aagctttctc caacctctcc tactacacca gggcccttcc 540 tcctgtggct gatgactgcc caactcccat gggtgtgaaa ggaaataagg aacttcctga 600 ttcaaaagaa gttctggaaa aggttcttct gaggagagag ttcatcccgg atccccaagg 660 cacaaatatg atgttcgcat tctttgccca gcacttcact catcagtttt tcaagacaga 720 tcagaagcga ggacctgggt tcacccgagg actgggccat ggagtggact taaatcatgt 780 ttacggtgaa actctagaca gacaacataa acttcgactt ttccaggatg gaaaattgaa 840 atatcaggtc atcggtggag aggtgtatcc tcccacagtc aaagacactc aggtagacat 900 gatctaccct ccccacgtcc ctgagcacct gcggttcgct gtggggcagg aagtctttgg 960 tctggtgccg ggtctgatga tgtatgctac catctggctt cgggagcaca acagagtgtg 1020 tgatattctc aaacaggagc atcctgagtg ggatgacgag cgactgttcc aaaccagcag 1080 gctcatactg ataggagaga cgatcaagat agtgatcgaa gactacgtgc aacacctgag 1140 gggttaccac ttccaactca agttcgaccc agacctgctt ttcaaccagc agttccagta 1200 tcagaaccgc attgcctctg aattcaagac actctatcac tggcatccgc tgctgccgga 1260 caccttcaac attgaagacc aggagtacac tttcaaacag tttctctaca acaactccat 1320 cctccttgaa cacggacttg ctcactttgt tgagtcattc accagacaga ttgctggccg 1380 ggttgctggg ggaaggaatg ttccaatcgc tgtacaagca gtggcaaagg cctccattga 1440 ccagagcaga gagatgaaat accagtctct caatgagtac cgcaaacgct tctccctgaa 1500 accttacaca tcgtttgaag aacttacagg agagaaagaa atggctgcag agttgaaagc 1560 cctctaccat gacatcgatg ccatggaact gtatcccgcc ctgctggtgg aaaagcctcg 1620 nccagatgct atctttgggg agaccatggt agaacttgga gctccattct ccttgaaagg 1680 ccttatgggt aatcccatct gttctcctca atactggaaa cctagcacct tcggaggaga 1740 agtggggttt aggatcatca acactgcctc aattcagtct ctcatctgca ataatgtgaa 1800 agggtgtccc ttcgcctctt tcaatgtgca agacccgcag gctaccaaga cagccaccat 1860 caacgcaagt gcctcccact ccagactaga tgacattaac cctacagtac taatcaaaag 1920 gcgttcaact gagctgtaag agtctactga ccatatttat ttatttatat gaacaattta 1980 atttaattat ttaatattat acagaatgct tttttttcac ttaacatctt ctataacaga 2040 aggcaatatt cttgaacaat gttccatttg tgaagattcc tgtgttgtac ttttaaatat 2100 ggttatccga aagtgaaagg gaaaaaaaag aacactttca tttttcggca taagccagtg 2160 agaagggaaa tgaattttga tatctttata cttgaatgtc agctcatgac tagccttaat 2220 taagaacaaa tgaaaggtgt atgaatattt aaatgctgtt acaaggaggg aaatgtgaat 2280 atttaaatgc tgttacaagg agggaaatgc tgcattgttg gttatgactg tgtcatcctt 2340 actatgttag gagcaaccga tgtggaattg tttttaaatc ttgcatatct ttatctcatc 2400 aaagcaaagg ggtacaagtc cagttttaaa tgaacatgaa ggcagatacc ggcaactgtc 2460 tttttatttt ttaaaagcaa tctttgaaac aaatgatttg caatgtctaa atcgggagtt 2520 ggaatcactt tcgaaagccc ttactttctt gagctgtcaa atttgtaccc acacagatta 2580 agcagctacc ataaacacaa atctaaaact ggggaaaact attatgactg atggttaaga 2640 taccatgtca gggatctttc ttttctcagg agtagtgaaa agctactatg acaatcagac 2700 cttccttgta cgtcagattg ctggcgtagg aaggtggcgg agcccgtgat gttctgtcct 2760 aaacgatgga aaagctttaa agcttgtgtg tgagtggtag ccagcaaaac ctatcatagc 2820 aacaaaagag tccacaaaca aaataaccaa gaacaaagaa gggttcccaa gcttaaagac 2880 cgcatcgagg gttaaacttt ttggaaggga gacttttcag atcctcctgt gggtgcttgg 2940 cttgtgactt tggcaggctg gattttaaag agtttttctg ttgcacagta tgacacaaca 3000 gcccatctct caatgcaaaa ggtatcagtg ggcttcttca aaactttgaa atgtcttcca 3060 gctcacggta ccagaagtgc agtaggcccc atgcaatgtg tgagttcagc ctggatgcca 3120 gcatgatgct ctccttactc tgtttcttgt agtcattttg ctctgagaaa ctgctgattg 3180 atctgttttt gtagctgtgt tccaggctct tagtattctt tctttaacct ccattaatat 3240 tttctctact tgaagtttta cattcaggaa aaacctcagc tcaggactac tgtgtagctc 3300 cccttcggag gaagaagtta ctttagacaa aagggaaaaa aaattaaatg tatttttcat 3360 ttgtaattaa atggaagggc cctaccaaga ttctagaatt ggagggggtt ctgacaagaa 3420 agttacattc ttgtcctgaa gaattgcttt cttatttaaa aacagagtca gttagtgggt 3480 agttctgggc aatagaaata aatataaaac aataatgaca atcattctct acatctcatt 3540 atcagctgag gtactgtata ttactgaatt tactgaagat agttatgtct ttcagacatt 3600 gttgttataa actatgttta agcctactac aagtgtttct tttttgcatt atgtcagaat 3660 tgatgtacct tttttttaat gattacctcc ctgtactact gtgcgaacaa tcaaacaaaa 3720 tgatgagatt aatggtcatg gataaatttc aagaaaacta gtgtatttga ttgaaaagtt 3780 taaagttaga acttaggcca ttggaattta ctcatatagc aaactgcgta gagccaatat 3840 tgactcacct acacacgtta tacagattga cattttagac atttggaagg ccccgtaggt 3900 attttattag ttagaactta attttttgaa aaaacatatc caaagcacaa taggcattag 3960 aatttgtgca tcgagaactg attacaaata atattgatat gtatgtaaat aactgagaag 4020 tatgtcttat gaagaaatat attttattac aaaaaattat aaaacatttt caagattata 4080 tgctttaaaa gtttaagatc agaaaataat caactttaga aaaacacgtt taaaaattgt 4140 taacatcatt gatt 4154 2 1582 DNA Rattus norvegicus 2 gaaagggagc ggcggcggga ggcgcggagc gcgggcgccg ggaagatgct gcagtccctg 60 gccggcagct cgtgcgtgcg cctggtggag cggcaccgct cggcctggtg cttcggcttc 120 ctggtgctgg gctacctgct ctacctggtg ttcggcgccg tggtcttctc gtccgtggag 180 ctgccttatg aggacctgct gcgccaggag ctgcgcaagc tgaagcggcg cttcctggag 240 gagcacgagt gcctgtcgga gccccagctg gagcagttcc tgggccgcgt gctggaggcc 300 agcaattatg gagtgtcggt gctcagcaac gcctcgggga attggaattg ggacttcacc 360 tcggcgctct tcttcgccag cacggtgctc tccaccacag gctatggcca cacggtgccc 420 ttgtcagatg ggggcaaggc cttctgcatc atctactctg tcattggcat cccgttcacc 480 ctcctcttcc tgacggccgt ggtccagcgt gtcaccgtgc atgtcacccg cagaccggtc 540 ctctacttcc acatacgctg gggcttctcc aagcaggtgg tggccatcgt ccatgccgtt 600 ctgctgggat tcgtcaccgt gtcctgcttc ttcttcatcc cggccgccgt gttctccgtg 660 ctggaggatg actggaactt cctcgaatct ttttacttct gtttcatctc cctgagcacc 720 atcggcctgg gggactacgt tccaggggaa ggctacaacc agaagttccg ggagctgtac 780 aagatcggga tcacgtgtta cctgctcctg ggactcatag ccatgctggt tgtcctggaa 840 accttctgtg agctccacga gctgaagaaa ttcaggaaaa tgttctatgt gaagaaagac 900 aaggatgaag accaagttca catcatggag catgaccaac tgtccttttc ctccatcact 960 gagcaggcgg ccggcctgaa ggaggagcag aagcaaaacg agccttttgt ggcctcccag 1020 tcaccaccct atgaggatgg ctctgcaaac cattgagcat gggtcaccag accccggaca 1080 tgaggcagag cctagactgt gttcattttt acgagaaagt caaagctaag atgatgttat 1140 tttaagaaat atctactgtt aacaatattt taaaaacacg gaactttgga tcctggggga 1200 gtgggtttta atctctgggc aaatgaggtg tgcacccata actcacaggc aatgtgatca 1260 cctgacatca tgcagctgta actcacaggc gatgtcatca cctggcatcc tgcacctgta 1320 actcacaggt gatgtcatcg cctggcatcc tgcacctgta actcacaggt gatattatca 1380 cctggcatcc tgcacctgta actcacaggt gatgtcatca cctggcatcc tgcacttgta 1440 actcacaggt tgatgtcatc gcctggcatc ctgcacctgt aactccacag ttgatgtcat 1500 cacctgggca ttacacactg gggaagaaat acttgaaaca tgttactgca ccggaaatcc 1560 cattctaaaa ccaaacttac ta 1582 3 645 DNA Rattus sp. 3 atgaactttc tgctctcttg ggtgcactgg accctggctt tactgctgta cctccaccat 60 gccaagtggt cccaggctgc acccacgaca gaaggggagc agaaagccca tgaagtggtg 120 aagttcatgg acgtctacca gcgcagctat tgccgtccaa ttgagaccct ggtggacatc 180 ttccaggagt accccgatga gatagagtat atcttcaagc cgtcctgtgt gcccctaatg 240 cggtgtgcgg gctgctgcaa tgatgaagcc ctggagtgcg tgcccacgtc ggagagcaac 300 gtcactatgc agatcatgcg gatcaaacct caccaaagcc agcacatagg agagatgagc 360 ttcctgcagc atagcagatg tgaatgcaga ccaaagaaag atagaacaaa gccagaaaaa 420 aaatcagttc gaggaaaggg aaagggtcaa aaacgaaagc gcaagaaatc ccggtttaaa 480 tcctggagcg ttcactgtga gccttgttca gagcggagaa agcatttgtt tgtccaagat 540 ccgcagacgt gtaaatgttc ctgcaaaaac acagactcgc gttgcaaggc gaggcagctt 600 gagttaaacg aacgtacttg cagatgtgac aagccaaggc ggtga 645 4 367 DNA Rattus norvegicus 4 tttttttttt tttttgtctg ttttctgaaa gagggacagt ttattatcaa ttcacaatta 60 aagcagcatg caatttatta ttttttttaa ctttttgttt tattcctggc aacggcaaca 120 aaccacaaca ttatcgagga atgttatgca gacttttaaa gttgtgcgca aatgactgtt 180 tcattctggt catggatatg tccaataaat agattgtaga accactgtac tgtataaact 240 tcatttatac atgcagttca taaaattatc tttttcttaa ctgaataatt taccctgtta 300 tgtctatata caaatagata atttttgtct caatataatc tatacaacat aaatccctcg 360 tgccgaa 367 5 405 DNA Rattus sp. 5 ggcttttatt attcacatgc tcggtagaaa acggggttta gtaaactggg tggaggtgta 60 cggcaagact ctgagttggt ccggaaatta tttacacctg agggcagcag cactgttcgt 120 cacttcaggc acagcacgtg cacttgtccg aggcaccttt gcaaacacag ccctgggcac 180 atttggagca gcccacgggg cagcaggagc agcagctctt cttgcaggag gtgcatttgc 240 agtttttgca gccgcaggag ctggaccagg tgcaggagcc gccggtggag caggaccagt 300 tggggtccat tccgagatct ggtgaatctg gagcaacggg gtaagctaca agaaggcagt 360 ccctcgtgcc gaattcttgg cctctagggc caaattccct atagg 405 6 1080 DNA Rattus norvegicus 6 ctcatgaccc agataagtgg agtgaagaaa ctgatgcaca gctcaagcct gaacaacaca 60 agcatctcac gctttggagt caacacggaa aatgaggatc atctagccaa ggagctggaa 120 gacctgaaca aatggggcct taacatcttc aacgtggctg ggtactccca taatcggccc 180 ctcacatgca tcatgtacgc cattttccag gaaagagacc ttctaaagac gtttaaaatc 240 tcctccgaca ccttcgtaac ctacatgatg actttagaag accattacca ttctgatgtg 300 gcgtatcaca acagcctgca cgctgctgac gtggcccagt caacgcacgt tctcctctct 360 acgccagcac tggatgctgt cttcacagac ctggaaatcc tggctgccat ttttgcagct 420 gccatccatg atgttgatca tcctggagtc tccaatcagt ttctcatcaa tacaaattcc 480 gaacttgctt tgatgtataa tgacgaatct gtgctggaaa accatcacct cgctgtggga 540 ttcaagctcc ttcaagagga acattgcgac atctttcaga atcttaccaa gaagcaacgc 600 cagacactca ggaaaatggt gattgacatg gtgttagcaa ctgatatgtc caagcacatg 660 agcctcctgg ctgaccttaa aacgatggta gaaaccaaaa aggtgacgag ctccggtgtt 720 ctcctcctgg acaactatac tgaccggata caggttcttc gcaacatggt acattgtgca 780 gacctgagca accctaccaa gtccttggag ttgtatcggc aatggactga tcgcatcatg 840 gaggagtttt tccaacaggg agacaaagaa cgggagaggg gaatggagat tagcccaatg 900 tgtgataaac acacagcttc tgtggaaaag tcccaggttg gtttcattga ctacattgtc 960 catccattgt gggagacctg ggcagacctg gttcagcctg atgctcaaga cattttggac 1020 acactagaag ataacaggaa ctggtaccag agtatgattc cccagagccc ctctccacca 1080 7 580 DNA Rattus norvegicus 7 tttttttttt tttttttctg atcttaattc attttattct acaaaatgct actcagtgga 60 aagtaggaaa gccaacaaga caacaagaac ataaaacgag aacaaacccc gagggaaaat 120 aagttttaat atgttcttcc ctccatagca gcaagctcta aacagctttc cttagtgcaa 180 atactgtagg cttgtgtcac acacagtaca cagaacaacg caacacacac caccacagat 240 gcttctgagc agagatactc ctcaaaaatt taaaactata caaagatttt ttgagcacgt 300 ggtcctgcct ggagaattcg actagagaga ccctcctagg accatttcac cattactgta 360 aaaacgggac aaaaggtccc cagaaaggaa attagaattc cccatggagc cataaaacct 420 tgtacaactc gtttgcctcc agggtctaat agcaaatttc actgcacgtc attgacatat 480 cccaaatacg gatgcataaa gcttgagttt ctacgatata ccaaaatacg atatatatac 540 aactcccact gcaaaagaaa ccctgatacc tagtctttat 580 8 1159 DNA Rattus norvegicus misc_feature (1)..(1159) where n may be a or g or c or t/u, unknown, or other 8 ccgattaggt ccccaaaacg ggggacgtcc atggggatga acatgatttc caagggtacg 60 gagaaagcac ttctgaagct tcaagagttc tttcctgagc tgcagattct ggcggtcagt 120 ggtaactatt gcaccgacaa gaaacctgct gccataaact ggatcgaagg gagaggaaag 180 actgtggttt gtgaagctgt cattccagcc aaggtggtga gagaagtatt aaagagcact 240 acggaagcta tggttgacgt aaacattaat aagaatcttg tgggctctgc catggctggt 300 agcataggag gctacaacgc ccatgctgcc aacatcgtca ctgccatcta cattgcatgt 360 gcccaggatg cagcacagaa tgtggggagt tcaaactgta ttactttaat ggaagcaagt 420 ggtcccncnn ntgaagactt gnnnnncagc tgnnncatgc cgtctataga gatcggaacc 480 gtgggtggtg ggaccaacct tctacctcag caagcctgcc tgcagatgct aggtgttcaa 540 ggggcgtgca aagacaatcc tggagaaaat gcacggcagc ttgcccgaat tgtgtgtggc 600 actgtaatgg ctggtgagtt gtccttgatg gcagcattgg cagcaggaca tcttgtcaga 660 agtcacatgg ttcacaacag atcaaagata aatttacaag atctgcaggg aacatgcacc 720 aagaaggcag cttgagcatc ctgacatact tgaactgaaa cacgggcatt gggttctcaa 780 ggtctaacat gaaatctgtg aattaaaaat gtcagtgcag tgtcttgtgg aagatgaacg 840 tgatcagtga gcctgcttgg tttctggctc tttcagagac gtctgcggtc ctttgcacca 900 gactcctcag acgtgggaac tatggttctt tccgtgccgt attctagaaa gatctcatgt 960 ggatgtcatg gtgctctgag caccacagat gtgactgcag ctcgtttcta aaagctgcca 1020 caagctggaa gctggtgttt tgacgaaatg atggatcttg gtgatcagtg tggggctcac 1080 ctccaatggg ttaaaatgga gttttaaatg acactgtagc tgacagaact ctcgattttt 1140 atttattcag tctgggcgg 1159 9 539 DNA Rattus norvegicus 9 caagctcatt cctcgcagag gcgcccagag cagagcaccc gctgcgcaga gaccacagcc 60 cgcccgccat gatgctaggt aacaaacgaa tggggctgtg tggactgacc ctcgctctat 120 ccctgctcgt gtgtttgggc attctggctg aggggtaccc ctccaagccg gacaatccgg 180 gcgaggacgc gccagcagag gacatggcca gatactactc cgctctgcga cactacatca 240 atctcatcac cagacagaga tatggcaaga gatccagccc tgagacactg atttcagatc 300 tcttaatgag agaaagcaca gaaaatgccc ccagaacaag gcttgaagac ccttccatgt 360 ggtgatggga aatgaaactt gctctcctga cttttcctag tttcccccca catctcatct 420 catcctgtga aaccagtctg cctgtcccac ccaatgcatg ccaccaccag gctggattcc 480 gacccatttc ccttgttgtc gttgtatata tgtgtgttta aataaagtat catgcattc 539 10 465 DNA Rattus norvegicus 10 tttttttttt tttttttgaa agtttaggca ttttaatcta cacaaaaaac tgcaagcaaa 60 ctaatatcta aggtagagta aaatgcctaa actttcaaaa caaaaggcca aaaaaaagaa 120 ctttagctgc acttccaagc aattaaatta atgagagttc caatccttgg gctcccctta 180 gcaatgtaca gctgttcaac ctcaaatacc aacaacagtg gaagaaatga tagttttcta 240 tacttagcca cgtccacgga gttgactgta aagactagga ataataagca agataactct 300 aggagaagat gacacaaact cactttctag ctgcatttct gtaccgtaat ttcagcctct 360 atattctcga aaccaacgct ttaaaagaat cacactgcaa agctggtctc aagtataaat 420 ggcaaaaaca aagtacctga tgtgtatgcc tcgtgccgaa ttctt 465 11 2002 DNA Rattus norvegicus 11 ccctttaatt tcctcgaaaa ctccaatcac tcggctgaag ccatgccttg tgttcaggcg 60 cagtatgggt cctcgcctca aggagccagc cccgcttctc agagctacag ttaccactct 120 tcgggagaat acagctccga tttcttaact ccagagtttg tcaagtttag catggacctc 180 accaacactg aaattactgc caccacttct ctccccagct tcagtacctt tatggacaac 240 tacagcacag gctacgacgt caagccacct tgcttgtacc aaatgcccct gtccggacag 300 cagtcctcca ttaaggtaga agacattcag atgcacaact accagcaaca cagccacctg 360 ccccctcagt ccgaggagat gatgccacac agcgggtcgg tttactacaa gccctcttcg 420 cccccgacac ccagcacccc gggcttccag gtgcagcata gcccgatgtg ggacgatccg 480 ggctcccttc acaacttcca ccagaactac gtggccacta cgcatatgat cgagcagagg 540 aagacacctg tctcccgcct ttcactcttc tcctttaagc agtcccgccc gggcactcct 600 gtgtctagct gccagatgcg ctttgacggg cctctgcacg tccccatgaa cccggagccc 660 gcgggcagcc accacgtagt ggatgggcag accttcgccg tgcccaatcc cattcgcaag 720 ccggcatcca tgggcttccc gggcctgcag atcggccacg cgtcgcagtt gcttgacacg 780 caggtgccct cgccggcgtc ccggggctct ccctccaatg agggtctgtg cgctgtttgc 840 ggtgacaacg cggcctgtca gcattacggt gttcgcactt gtgagggctg caaaggtttc 900 tttaagcgca cggtgcaaaa aaacgcgaaa tatgtgtgtt tagcaaataa aaattgccca 960 gtggacaagc gccgccgaaa tcgttgtcag tactgtcggt ttcagaagtg cctggctgtt 1020 gggatggtta aagaagtggt tcgcacggac agtttaaaag gccggagagg tcgtctaccc 1080 tcaaaaccga agagcccaca ggatccctct cccccctcac ctccggtgag tctgatcagt 1140 gccctcgtca gagcccacgt cgactccaat ccggcaatga ccagcctgga ctattccagg 1200 ttccaggcaa accctgacta tcagatgagt ggagatgata ctcaacatat ccagcagttc 1260 tacgatctcc tgactggctc tatggagatc atcagagggt gggcagagaa gattcctggc 1320 tttgctgacc tgcccaaagc cgatcaggac ctgctttttg aatcagcttt cttagaatta 1380 tttgttctac gcttagcata caggtccaac ccagtggagg gtaaactcat cttttgcaat 1440 ggggtggtct tgcacaggtt gcaatgcgtg cgtggctttg gggaatggat tgattccatt 1500 gttgaattct cctccaactt gcagaatatg aacatcgaca tttctgcctt ctcctgcatt 1560 gctgccctgg ctatggtcac agagagacac gggctcaagg aacccaagag agtggaagag 1620 ctacaaaaca aaattgtaaa ttgtcttaaa gaccatgtga ctttcaataa tgggggattg 1680 aaccgaccca actacctgtc caaactgttg gggaagctcc cagaacttcg caccctttgc 1740 acacaggggc tccagcgcat tttctacctg aaattggaag acttggtacc accaccagca 1800 ataattgaca aacttttcct ggacacctta cctttctaag actttctccc atgcacgtca 1860 aagaactgga aagaaaaaaa aaatccagag ggggctggtc aagatgggta gagagctggc 1920 tgaagtgtcc ggttcatgtc tcccttctgt agacccctag ccctcacccc taaagtaaac 1980 aaacaaacaa gcaaacaaac gg 2002 12 2469 DNA Rattus norvegicus 12 atgtggggct ggaggggcct cctcttctgg gctgtgctgg tcacagccac tctctgcact 60 gccagaccag ccccaacctt gcccgaacaa gctcagccct ggggagtccc tgtggaagtg 120 gagtctctcc tggtccaccc tggtgacctg ctacagcttc gctgccggct gcgcgatgat 180 gtgcagagca tcaactggct gcgggatggg gtgcagctgg cggaaagcaa ccgtacacgc 240 atcacagggg aggaggtgga ggtgcgggat tccatccccg ctgactctgg cctctacgct 300 tgtgtgacca acagcccctc tggcagcgat accacctact tctccgtcaa tgtctcagat 360 gcactgccat cctcggagga cgatgacgat gatgatgact cctcctcaga ggagaaagag 420 acagacaaca ccaaaccaaa ccgtaggcct gtggcgccat actggacatc cccagagaaa 480 atggagaaga aactgcacgc agtgccagct gccaagacgg tgaaattcaa atgcccgtcg 540 agtgggacac ccagccccac tttgcgctgg ttgaaaaacg gcaaggaatt caaacctgac 600 caccggatcg gaggctacaa ggttcgttac gccacttgga gcatcataat ggactctgtg 660 gtgccttctg acaagggcaa ctacacctgc atcgtggaga acgagtatgg gagcattaac 720 cacacctacc agctagacgt tgtggagcga tcccctcacc ggcccatcct tcaggcaggg 780 ctaccagcca acaagaccgt ggccctgggc agcaacgtgg agttcatgtg caaggtgtac 840 agtgaccccc agcctcacat ccagtggctg aagcacatcg aggtgaatgg gagtaagatc 900 ggtccagaca acttgccgta tgaccagatc ctgaagactg ctggagttaa taccaccgac 960 aaggaaatgg aggtgcttca tctacggaat gtctcctttg aggatgcggg ggagtatacg 1020 tgcttggcgg gtaactctat cggactctcc catcactctg catggttgac cgttctggaa 1080 gccctggaag agagaccagc cgtgatgacc tcacctctgt acctggaaat cattatctac 1140 tgcaccgggg ccttcctgat ctcctgtatg gtgggctccg tcatcatcta caagatgaag 1200 agcggcacca agaagagcga cttccatagc cagatggctg tgcataagct ggctaagagc 1260 atccctctcc gcagacaggt aacagtgtca gctgactcca gcgcatccat gaactccggg 1320 gttctcctgg ttcggccttc gcgactgtcc tccagcggaa cccccatgct agctggcgtc 1380 tctgaatatg agctccctga agatccccgc tgggagctgc cccgggacag actggtctta 1440 ggaaaaccgc ttggcgaggg ctgcttcggg caggtggtgt tggccgaagc catcggtctg 1500 gataaggaca aacccaaccg cgtgaccaaa gtggccgtga agatgttgaa gtctgatgcg 1560 acggagaagg acctgtcgga cctgatctcg gagatggaga tgatgaaaat gatcgggaag 1620 cacaagaata tcatcaacct gctgggggcg tgcacacagg atggtcctct ctatgtcatt 1680 gtggagtatg cctccaaagg caatcttcgg gagtatctgc aggcccggag gcctcctggg 1740 ctggagtatt gctacaaccc cagccacaac cctgaggaac agctgtcttc caaagatctg 1800 gtgtcctgtg cctatcaggt ggcccggggc atggagtatc ttgcctcgaa gaagtgtata 1860 caccgagacc tggctgctag gaatgtcctg gtgacagagg ataatgtcat gaagatcgca 1920 gactttggcc tagctcgaga cattcaccat atcgactact ataagaaaac caccaatggc 1980 cggctgcctg tgaagtggat ggcacctgag gcattgtttg accggatcta cacccaccag 2040 agtgatgtgt ggtcttttgg ggtgctctta tgggagatat tcactctggg tggctcacca 2100 aaccccggcg tgcctgtgga agaacttttc aagctgttga aggagggtca tcgaatggac 2160 aagcccagta actgtaccaa tgagctgtac atgatgatgc gggactgctg gaacgcagtg 2220 ccctctcaga gaccaacttt caagcagttg gtggaagacc tggaccggat tgtggccttg 2280 acctccaacc aggagtatct ggacctgtcc atgccactgg accaggactc gccaagcttt 2340 ccggacacac ggagctctac ctgctcttca ggggaggact ctgtcttctc tcatgagcca 2400 tttcctgagg agccctgtct gccccgacac cccacccagc ttgcaaatgg cggactcaac 2460 cggcgctga 2469 13 3165 DNA Rattus norvegicus misc_feature (1)..(3165) where n may be a or g or c or t/u, unknown, or other 13 gcagaatttg gcaggcccag gctagggtgc accaaccctt aggctcagaa tgacgagaca 60 gggccgggct ctttccctcc ggctattgcc acacttcctg cctcggctct ttttccctag 120 cctgtttcta aggaagggag tggggttggg cgaccgcacc ccagctatcc cgcctcttcg 180 gccctccaaa agctgacagg atatcgggcg agcccagagt gactaagggg aggctttgga 240 ctcaggtaca gggtatgtca gtgcctagag accacttacg aggtacagtc tcattcttac 300 aagccccttg tctctgggat cctccaagtc gtcctctcgg gtagatctca gataccctcc 360 tggcatcgcc tttacagctc agaaggaagc catgtcctga ccattttaac tttccatcag 420 agttctgcaa ataatcacag tgaccccaat ctttgctaga tatgttccca tcttctactc 480 ctattgcctc agcgtttcca ggacccttgt tcactttctg catatctaaa ttgacctccc 540 caaaattact ccccttttcc acttatggga gaaccctcag actcatctgt gggctttaga 600 gcccctactc atttctatac aataggtacc tgaccctgct tcctgaacct tctctcccac 660 ctgttcttaa gcacttgcac tcggggcctc tagttaattc ctaggtcaaa tgttaacaat 720 tgctcccgtt ttgtaccttc cctgttttct ggggtcttct agtatattgt cagttcacaa 780 gtcacagcta ttgtgggtcg ccgcgagact attccgggaa tatcctgagg caccccatct 840 accttgcagt gtacagagat ctcatagcca ccgggttgta ctcccgcagt cctaactcct 900 atgtgcctcg gtccctcaac atccctttgc ttgagtgtct ctgtctctac agcccctccc 960 cctgcagccg cgcagagcca ccgggctgct ggccgctgtt tacaaggaca cgcgcttcct 1020 gacagtgacg cgagccgcct cctccccttc cccacgctct aggagggggc cgcgggggcc 1080 tggctcccgc gtcggccaat cggagtgcac ttccgcagct gacaaattca gtataaaatg 1140 cttggggctg gggccgaaca ctggggacct tgagggtggc caggccagct ttggatcctg 1200 cagggagcgg ggagctgaga gaagagacgc tgagaaagcg ggcgcaccac ggagggagag 1260 aaaagctcca gaagccgggc agcgctttta cgcacagcta ccaactggcc gctgccgacc 1320 gtctccagct cccgaggacg cgcgaccgga cgccgggtcc cgccacagcc gaggacagct 1380 cgccggtcgc cgcaggcagg cccggagcgg ccttcagggg gacctttccc agatcgccca 1440 ggccgcccgg atgtgcacga aaatggaaca ggctttctat cacgacgact cttacgcagc 1500 ggcaggatac ggtcggagcc ctggcagtct ttctcttcac gactacaaac tcctgaaacc 1560 caccttagcg ctcaacctgg cagatcctta tcggggtctc aagggtcctg gggcgcgggg 1620 tccaggccca gagggcagtg gggcaggcag ctacttttcg ggtcagggat cagacacagg 1680 cgcatctctg aagctagcct ccacggaact ggagcgcttg atcgtcccca acagcaacgg 1740 cgtgatcacg acgacgccca cgcctccggg acagtacttt tacccccgtg ggggcggcag 1800 cggcggaggt acagggggcg gcgtcaccga ggagcaggag ggctttgcgg acggttttgt 1860 caaagccctg gacgacctgc agaagatgaa ccacgtgacg ccccccaacg tgtctctggg 1920 cgccagcggg ggtccccagg ccgggccagg gggcgtctat gctggtccgg agccgcctcc 1980 ggtctacacc aacctcagca gttactcccc agcctctgca ccctctggag gttccgggac 2040 cgccgtcggg actgggagct catacccgac ggccaccatc agctacctcc cacatgcacc 2100 accctttgcg ggcggccacc cggcacagct gggcttgagc cgtggcgctt ccgcctttaa 2160 agaggaaccg cagaccgtac cggaggcacg cagccgcgac gccacgccgc ctgtgtcccc 2220 catcaacatg gaagaccagg agcgcatcaa agtggagcga aagcggctgc ggaacaggct 2280 ggcggccacc aaatgccgga agcggaagct ggagcgcatc gcgcgcctgg aggacaaggt 2340 gaagacactc aaggctgaga acgcggggct gtcaagtgct gccggcctcc tacgggagca 2400 agtggcgcag ctcaagcaga aggtcatgac ccacgtcagc aacggctgcc agttgctgct 2460 aggggtcaag ggacacgcct tctgagagcc tcccttgctc catacggaca cccccagcct 2520 tgaaggctgg gcgcccgccc cccactgggg tgaggggggc aggcgatggg cactcgccca 2580 gaggtctggg gcgcagctca cacactggac tctggcctgc ccgcctgcgc ccagtccttc 2640 cacctcgagg tttacatggc ccccttccag cgtattttgt atgttttttt tttctggaaa 2700 gagactgaat tcatattgaa tataatatat ttgtgtattt aacagggagg ggagaagggg 2760 gttgtcgcgg cggagtggcc ccgccgcctg gtactcagcc tgtggggata ctagggagga 2820 acctccgccc cctgccctcc ccctctgcac agtactgtgg agaagaaaca cgcacttcgt 2880 gtctaaagtc tattttaaga tgtgtttgtg tgtgtgtttg actttttatt gaatctattt 2940 aagtaaaaaa aaagtcttta ttaatttctg tggtctcttt cttccaagct gnncgnatgg 3000 agggagaaga ttgggctgnn ccaagcccgg ggcagtttgt agttctctcc tttcgggtat 3060 cttaaggtcc agtacaagcg ctcaaacctc ccatcccctg aggtcctggc accagagctg 3120 cgcaggcagg cagtggccga gttacaagct atccaggccg aattc 3165 14 421 DNA Rattus sp. 14 atctcttatt tacaaacact gggtaggaca cccaaacaaa caaacatgga ataacttaca 60 aaggcaggaa gctgtttatt atagacagta atcagctttc atcaaattaa aaaaaaatat 120 atgtacatac acagttgaga gaggcaggcc aggggagttc atccgcaatc tagcctggta 180 ctcacaagcc tccctcccct tcccagccct ccctgctttg tgttcttacg gagcactaca 240 gaagcaatct acagtctcta ttgcagtttg taagccccca cccacccccc ttaatactga 300 atgagatcga atgttaggtc catgcagttc ttggtcaatg ttaacgaaaa gtccatcgtt 360 ctggtcgcgc gggcacagcc cgttcgcaaa gcgtggcggt caacaggccg ctgctctctg 420 g 421 15 2957 DNA Rattus norvegicus 15 ctcggacagc atccgccgcg ctcgcccggg gctcctagag aacccggggg cgcttgaccg 60 cgcgcggcgg cccggcgtcg tacatcgcga ggtcgtcgca ctcgcgcaac ccagagccag 120 gcccgctgtg cccggagctc atgagcacca tgcacctgct gacattcgcc ctgctttttt 180 cctgctcctt cgcccgcgcc gcctgcgacc ccaagatcgt caacatcggc gcggtgctga 240 gcacgcgcaa gcatgaacag atgttccgcg aggcagtaaa ccaggccaat aagcgacacg 300 gctcttggaa gatacagctc aacgccactt ctgtcaccca caagcccaac gccatacaga 360 tggccctgtc agtgtgtgag gacctcatct ctagccaggt ctacgctatc ctagttagcc 420 acccgcctac tcccaacgac cacttcactc ccacccctgt ctcctacaca gctggcttct 480 acagaatccc tgtcctggga ctgactaccc gaatgtccat ctactctgac aagagtatcc 540 acctgagttt ccttcgcacg gtgccgccct actcccacca gtccagcgtc tggtttgaga 600 tgatgcgagt ctacaactgg aaccacatca tcctgctggt cagcgacgac cacgagggac 660 gggcagcgca gaagcgcttg gagacgttgc tggaggaacg ggagtccaag gcagagaagg 720 tgctgcagtt tgacccagga accaagaatg tgacggctct gctgatggag gcccgggaac 780 tggaggcccg ggtcatcatc ctttctgcaa gcgaggacga cgctgccaca gtgtaccgcg 840 cagccgcaat gctgaacatg acgggctctg ggtacgtgtg gctggtcggg gaacgcgaga 900 tctctgggaa cgccctgcgc tacgctcctg atggcatcat cggacttcag ctcatcaatg 960 gcaagaatga gtcagcccac atcagtgacg ccgtgggcgt ggtggcacag gcagttcacg 1020 aactcctaga gaaggagaat atcactgacc caccgcgggg ttgcgtgggc aacaccaaca 1080 tctggaagac aggaccattg ttcaagaggg tgctgatgtc ttctaagtat gcggacggag 1140 tgactggccg tgtggaattc aatgaggatg gggaccggaa gtttgccaac tatagtatca 1200 tgaacctgca gaaccgcaag ctggtgcaag tgggcatcta caatggtacc catgtcatcc 1260 caaatgacag gaagatcatc tggccaggag gagagacaga gaaacctcga ggataccaga 1320 tgtccaccag actaaagata gtgacaatcc accaagagcc cttcgtgtac gtcaagccca 1380 caatgagtga tgggacatgc aaagaggagt tcacagtcaa tggtgaccca gtgaagaagg 1440 tcatctgtac ggggcctaat gacacgttcc caggcagccc acgccacaca gtgccccagt 1500 gctgctatgg cttctgcata gacctgctca tcaagctggc gcggaccatg aattttacct 1560 atgaggtgca cctggtggca gatggcaagt ttggcacaca ggagcgggta aacaacagca 1620 acaaaaagga gtggaacgga atgatgggcg agctactcag tggccaagcg gacatgattg 1680 tggcaccact gaccatcaac aatgagcgtg cgcagtacat agagttctcc aagcccttca 1740 agtaccaggg cctgaccatt ttggtcaaga aggagattcc caggagcaca ctggactcat 1800 ttatgcagcc ttttcagagc acactgtggt tgctagtagg actgtcagtt catgtggtgg 1860 ctgtgatgct gtacctgctg gaccgcttca gtccctttgg ccgattcaag gtgaacagtg 1920 aggaggagga ggaagatgca ctgaccctgt cctctgccat gtggttttcc tggggcgtcc 1980 tgctcaactc cggcattggg gaaggtgccc cccggagttt ctctgcacgt atcctaggca 2040 tggtgtgggc tggtttcgcc atgatcatag tggcttccta cactgccaac ttggcagctt 2100 tcctggtgct ggatcggcct gaggagcgca tcacgggcat caatgacccc aggctcagaa 2160 acccctcaga caagttcatc tacgcaactg taaagcagag ctccgtggac atctacttcc 2220 ggaggcaggt ggagttgagt accatgtacc ggcacatgga aaaacacaat tacgagagcg 2280 cagctgaggc catccaggct gtgcgggaca acaagctgca cgcctttatc tgggactcgg 2340 ccgtgctgga gtttgaggct tcacagaagt gcgatctggt gaccacgggt gagctgttct 2400 tccgctcagg ctttggcatc ggcatgcgca aggacagccc ctggaagcag aacgtttccc 2460 tgtccatact caagtcccat gagaatggct tcatggaaga tctggataag acatgggttc 2520 ggtatcagga atgcgactcc cgcagcaatg ctcctgcaac cctcactttt gagaacatgg 2580 caggggtctt catgctggtg gctggaggca tcgtagctgg gattttcctc attttcattg 2640 agatcgccta caagcgacac aaggatgccc gtaggaagca gatgcagctg gcttttgcag 2700 ccgtgaacgt gtggaggaag aacctgcagg atagaaagag tggtagagca gagcccgacc 2760 ctaaaaagaa agccacattt agggctatca cctccaccct ggcctccagc ttcaagagac 2820 gtaggtcctc caaagacacg agcaccgggg gtggacgcgg cgctttgcaa aaccaaaaag 2880 acacagtgct gccgcgacgc gctattgaga gggaggaggg ccagctgcag ctgtgttccc 2940 gtcataggga gagctga 2957 16 644 DNA Rattus sp. 16 gctgtgtgat agttctttat ttcaccattt aagagaaaga aagatggagg aaaggtaaac 60 agtgttcagg cttcagcttt tgccagggga aggcttcggg tcatcgagac cccaaggtat 120 tgccaggtgc acaaatctgg attccgtggc aggcaggcaa agtgatcgct ctggtagccc 180 ttctcagagc ccatgaggat ctgatctgtc cacaagcaat gactgtcact ctccagtttg 240 caagggatgg ctgaacaggg aaacactgtg cacaccccac agccagcact ataggtcttt 300 acgaaggcct tttgctgagc agggctcaga ttatgccagg gaaccaggaa gctgcaggca 360 gtgatgtgca aatttccgtt ccttaaacgg cccgcgatga gaaactcctc gctgcggttc 420 tgggacttgt ggacatatcc acagaggctc tccatggctg gggtgtaggc gaaccggaaa 480 cctgtggcat ttcccacagc gtcgaatcct ttgagcatct tagtcatctt gatctcataa 540 cgctggtata aggtggtctc gatgatttct ggggaaccca tgaatttagc ccttataacc 600 aggtccgagt tgcagaaagc tgtctgtggg tgggttgggg caca 644 17 2562 DNA Rattus sp. 17 tggtgctggc gtttccctgc ttgcacgcgg ttccctcgag cgccgctccg accgacgtag 60 ccggccgcga aggcgcccag acggcaagcc agcgacccat gctgaagtga gcgcccaggt 120 cagcgagatg ctggcgctgc tgaccgccgg cgtggcgctc gccgtggccg cgggacaagc 180 ccaggataac ccgatacctg gcagtcgctt cgtgtgcacc gcgctgcccc ccgaagcggc 240 gcgcgccggc tgcccgctgc ccgcgatgcc catgcaggga ggcgcgctga gccctgagga 300 ggagctgcga gccgctgtgc tgcactggcg cgagaccgtc gtgcagcaga aggagacgct 360 gggcgctcag cgagaagcca tccgagaact caccagcaag ctggcccgct gtgagggact 420 agccggcggt aaggcgcgcg gcacgggggc cacgggcaag gacaccatgg gcgacctgcc 480 gcgggacccg ggccacgtcg tggagcagct tagccgctcg ctgcagaccc tcaaggaccg 540 cttggagagc ctcgagctcc aactccacac caacgcgtct aatgccgggc tgccgagcga 600 cttccgagag gtgctccagc ggaggctggg ggagctggag aggcagttgc tacgcaaggt 660 ggccgagctg gaagacgaga agtccctgct ccacaatgag acctcggctc accggcagaa 720 gacagagaac acactgaatg cactgctgca gagggtgact gagctggaga gaggcaacag 780 tgcattcaag tcaccagatg cattcaaagt gtccctccct ctccgtacaa actacctata 840 cggcaagatc aagaagacgt tgcccgagct gtatgccttc accatctgcc tgtggctgcg 900 gtccagcgcc tcgccaggca tcggcacgcc attctcctac gctgtgcctg ggcaagccaa 960 tgagattgtg ctgatagagt ggggtaacaa tcccatagag ctgcttatca acgacaaggt 1020 cgcacagctg cccctgtttg tcagcgatgg caagtggcac catatctgca tcacctggac 1080 cactcgagac ggcatgtggg aagcattcca ggacggggag aagctgggca ccggggagaa 1140 cctggcaccc tggcatccca tcaagccagg gggtgtgctc atcctggggc aggagcagga 1200 cactgtggga ggcagatttg atgccacaca ggccttcgtt ggagagctta gccagttcaa 1260 catatgggac cgtgtcctcc gggcacaaga gatcatcaac atcgccaact gctccacgaa 1320 catgcctgga aacatcatcc catgggtgga caacaatgtc gatgtgtttg gaggggcttc 1380 caagtggcct gtggagacgt gcgaagagcg tctcctggac ttgtagctac cttctccctg 1440 tcccagaggc caagagcggg ctgttctggg gagttcaagg catctattcc cgagttcaac 1500 taaaatctct ggcctgagta ggaaagaacc agagccccta aggcaggctg tgtggcctcc 1560 tttgtcttag gctcctatgt tcttactgct ttgttctttg gtgggaagtg accgaagccc 1620 tgggaagagt cctgagccac ttcctgctgg ggtttctagt aaagtctgtg agcctctcca 1680 cccctcctgt aaatgctagt gcaacccagc cctgcctgtc attttggatc cttagtgtct 1740 cgtgtgtgct tcccgtctgt cccctttgat ggctgtgtgg tcatcctacc ggggtggcct 1800 gggtcccttg tgtgtgtagc acatccctgc ttttgactga acacagtgca cagaagctac 1860 ccgcccctga aacagggtct ctccctcagt gtcatgtgca ctctggtctc tccctctgag 1920 gggactgcag ctgctggagg gccacgtgcc cagacagtcc ccagcatccc caaagcagac 1980 cctccgccat ggagaaagtc ccccacagct tccccaccct ctgtccacct ctcagacccc 2040 acgcttctaa ggaccattgc tgggttggct ttcaaaagct gctgctctca tctggtgcca 2100 aaagttcatt tgcagcttct acaccgttct gtgtggtttg gggattgact ttattccccc 2160 acaaaagagg aacagccatt agaagccagc ctcccctcct tttgatgctc agcccactgt 2220 gaagagtgag cttgcttgta agccacattg gtttctgtga gcatctgact ctcccccgtc 2280 cagtattttc cccggaactg gagattcgag tgccattcgg ctgctacctg cttagtgact 2340 ccaggctgca tcatgtatca taatttattt taaagacaaa gtgattcagt ggggaaattt 2400 ataaagctat aaatattata tattttattt ttcatacatg tttaaagtgc ggatccatgg 2460 atgttccatt tgtaggacca gcttgacgtg cccatcctga cattgtatgc cacaagagct 2520 cttgtgatga tggaattttg attaaagtgc actggaagat ga 2562 18 2858 DNA Rattus norvegicus 18 gaattcggca cgagcggaac tgtgaagggc tcccatcagg ctcccgactg acagagagct 60 agaaggcaca gagaaacctg aggattctca tttaactctg ggaactgctt caagaagcta 120 tagtaccaga gaacacctgg gaagtgtgag aattcctgca gctgggacca aaatgtcttt 180 catagatcct tatcagcaca taatagtgga acaccagtat tcccataagt ttacagtagt 240 ggttctacgt gccaccaaag taaccaaggg gacctttggc gatatgctgg acactcctga 300 tccttatgtg gaacttttca tctctacaac ccctgacagc aggaagcgaa caagacactt 360 caataatgat ataaaccctg tgtggaatga gacctttgag ttcattttgg atcctaatca 420 ggaaaatgtt ttggagatca cattgatgga tgccaattac gtcatggatg aaaccctagg 480 cacagctaca tttcctgtat cttctatgaa ggtgggagag aagaaagaag tcccttttat 540 tttcaaccaa gtcacagaaa tgattctgga aatgtctctt gaagtttcgt catgcccaga 600 cctacggttc agcatggcac tgtgtgatca ggagaaaaca ttcaggcagc agaggaaaga 660 gaacataaaa gagaacatga agaaactttt gggtccaaaa aagagcgagg ggctttattc 720 cacacgtgat gtgcctgtgg tggccatttt gggttccggc gggggtttcc gggccatggt 780 gggattctcc ggtgtgatga aggcgctcta tgaatcaggg attttggatt gtgcgaccta 840 cgttgctggt ctgtccggct ccacatggta catgtcaacc ttgtactccc accctgattt 900 tccagagaaa ggtcctgagg agattaatga agagctaatg aaaaatgtta gccacaaccc 960 tctcttactt cttacgccac agaaagttaa aagatatgtt gagtctttat ggaagaagaa 1020 aagttctggc cttcctgtca cctttactga catctttgga atgttaatag gagaaacact 1080 aattcaaaat agaatagtac cgaccttgag tagcttgaag gaaaaggtca gcgccgcccg 1140 gtgtcctctg cctctcttca cctgtctcca tgtcaaaccg gacgtgtcag agctgatgtt 1200 tgccgattgg gtagaattta gtccatacga aattggcatg gcaaaatatg gtacctttat 1260 gactcctgac ctgtttggaa gcaaattttt tatgggaaca gttgtaaaaa aatatgaaga 1320 aaaccccttg catttcttaa tgggtgtctg gggcagtgcc ttttctatac tgttcaacag 1380 agttttggga gtttctggct tacagaataa aggttctaca atggaggagg aattagaaaa 1440 tattacagca aagcacattg tgagtaacga cagctctgac agcgatgacg aggcccaagg 1500 acccaaaggc accgagaatg aagatgcgga aagagagtac caaaatgaca accaagcaag 1560 ttgggtccat cggatgctaa tggccttggt gagtgactca gctttattca atacccgaga 1620 aggacgtgct gggaaggtgc ataacttcat gttgggcttg aatctcaaca catcgtatcc 1680 actgtctccc ctgagagact tcagccccca agattccttc gatgatgatg aactcgacgc 1740 agcggtagca gatccagatg aatttgaacg aatatatgaa ccactggatg tcaaaagtaa 1800 aaagattcat gttgtagaca gtgggctcac gtttaacctg ccgtatccct tgattctgcg 1860 acctcagaga ggtgtggatc tcatcatttc ctttgacttt tctgcaaggc caagtgacac 1920 cagccctcca ttcaaggaac ttctgcttgc agagaagtgg gctaaaatga acaagctccc 1980 ttttccaaag attgatcctt acgtgtttga tcgggaagga ttgaaggaat gctatgtgtt 2040 taaacctaag aatcctgatg tggagaagga ttgcacaacc attatccact ttgttctggc 2100 caacatcaac ttcagaaagt acaaggcccc aggtgttctg agggaaacca aggaagagaa 2160 agaaatagct gactttgaca ttttcgatga ccccgaatcg ccattttcaa ccttcaactt 2220 ccagtatcca aatcaagcat tcaaaaggct acatgatctg atgtacttca acacactgaa 2280 caacattgat gtgataaagg atgccattgt tgagagcatt gaatacagaa gacagaaccc 2340 atctcgttgc tctgtttccc tcagtaatgt tgaggcaaga aaattcttca acaaggagtt 2400 cctaagtaaa cccacagcag agtccatttg aattccatga ctactggagt tcagagcaca 2460 tgagagatca tcttactatg cacaagagac tgactgctac tcagagttgt ggggacggag 2520 gcgtgtgtta ggtgaaaacg gtgttgatta tgcaatactt ggcaacagtt tctgacagta 2580 tgaatttttt gacataagca tagggctata tactgtattt taaacattcc tcacattttt 2640 acctgagcat ttttatatat atataaaaat atcctttcct tttataaata ttaatagttt 2700 aactcagtaa aaaaaagctt cccattgtgt gtgaatgtta ttctgaacta gatttgttca 2760 tgccatgtta caacactatt tttatttaaa tgttcatatc tacacatgcg aaataaatac 2820 tttgatatac aaattgccaa aaaaaaaaaa aaaaaaaa 2858 19 1194 DNA Rattus norvegicus 19 ttcgccgcgc ccgcgcccgc gcaccacgac ttcctttccg acctcttcgc cgacgactac 60 ggcgccaagc cgagcaagaa gccgtccgac tacggttacg tgagcctcgg ccgcgcgggc 120 gccaaggccg caccgcccgc ctgcttcccg ccgccgcctc ccgccgcact caaggccgag 180 ccgggcttcg aacccgcgga ctgcaagcgc gcggacgacg cgcccgccat ggcggccggc 240 ttcccgttcg ccctgcgcgc ctacctgggc taccaggcga cgccgagcgg cagcagcggc 300 agcctgtcca cgtcgtcgtc gtccagcccg cccgggacgc cgagccccgc cgacgccaag 360 gccgcgcccg ccgcctgctt cgcggggccg ccggccgcgc ccgccaaggc caaggccaag 420 aaggcggtgg acaagctgag cgacgagtac aagatgcggc gcgagcgcaa caacatcgcg 480 gtgcgcaaga gccgcgacaa ggccaagatg cgcaacctgg agacgcagca caaggtgctg 540 gagctgacgg cggagaacga gcggctgcag aagaaggtgg agcagctgtc gcgagagctc 600 agcacgctgc ggaacttgtt caagcagctg cccgagccgc tgctggcctc ggcgggtcac 660 tgctagcccg gcgggggtgg cgtgggggcg ccgcggccac cctgggcacc gtgcgccctg 720 ccccgcgcgc tccgtccccg cgcgcgcccg gggcaccgtg cgtgcaccgc gcgcacctgc 780 acctgcaccg aggggacacc gtgggcaccg cgcgcacgca cctgcaccgc gcaccgggtt 840 tcgggacttg atgcaatccg gatcaaacgt ggctgagcgc gtgtggacac gggactgacg 900 caacacacgt gtaactgtca gccgggccct gagtaatcac ttaaagatgt tcctgcgggg 960 ttgttgctgt tgatgttttt gtttttgttt tttgtttttt gttttttttt tggtcttatt 1020 atttttttgt attatataaa aaagttctat ttctatgaga aaagaggcgt atgtatattt 1080 tgagaacctt ttccgtttcg agcattaaag tgaagacatt ttaataaact tttttggaga 1140 atgtttaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa 1194 20 1227 DNA Rattus norvegicus 20 ggaattcgca acgcctacac aaagatccct atcactgagc acccggaggc aaggctcaga 60 accccaggat cagcaacatg gagttctccc gtccaccgct agtgcatgtg aagggtatcc 120 cactcatcaa atactttgca gagacaattg ggccattgca gaacttcaca gcctggcctg 180 atgacttgct gatcagcaca tacccaaagt ctggtactac ctggatgagt gagatcctgg 240 atatgatcta tcagggtggc aagctagaga agtgtggccg cgcccccatc tatgcccggg 300 tacccttcct tgagttcaaa tgtccagggg ttccctcagg tcttgaaact ttggaagaga 360 caccagcccc acggctcctt aagacacatc tgcccctgtc cttgctccct cagagtctgc 420 tggatcagaa ggtcaaggtg atctacattg cccgaaatgc aaaggatgtg gttgtctcct 480 attataactt ctacaacatg gccaagctgc accctgatcc aggcacctgg gacagcttct 540 tggagaactt catggatggg gaagtgtcct atgggtcgtg gtaccagcac gtgaaggagt 600 ggtgggagct gagacacact caccctgttc tctatctctt ctatgaagac ataaaggaga 660 accccaaaag ggagatcaag aagattctag agtttttggg gcgctctcta cccgaggaga 720 ctgtggattc cattgttcac cacacatctt tcaagaaaat gaaagagaac tgcatgacta 780 actacacaac catccccact gagattatgg accacaatgt ttctcccttc atgaggaaag 840 gtactactgg ggactggaaa aataccttca ctgtagccca gaatgagcgc tttgatgccc 900 actatgctaa gacaatgaca gattgtgact tcaagtttcg ttgtgaacta tgagtggatt 960 atggctatac tgggaaccaa ggcaaactga cacagcccat catgatctca agtaaaatgt 1020 gatgtgttca atctacttgt tgtatgccta gaggaaatct gagctaagag aataggattg 1080 gggatgtggc tgaggcagag ggttttatca acgcatgtca ggaaagcaat cagtcccaac 1140 acctaaaaag aacctaaagt acaaacatgc aaaaaatagt aagataaact atattttacc 1200 tgaaagaata aatgccactg ggaaatg 1227 21 2507 DNA Rattus norvegicus 21 gagcgagagc gctgttgctg acccagctga gcccagctcc taggacgcca gccctcgacc 60 atctttcata ctccagccac ggaacggagc cagggcagac gggtccggat tttccccctg 120 ccccgaccct cctctccacc tcccgccgtc gtgacaccgg ctgtctctgg cagcccgttg 180 gtcatgaaaa ccttcacgtt gccagcatcc gtcctcttct gcttccttct actcatccgg 240 gggttgggag cagcaccccc cgggcgctcc gatgtttatc ctcctcccct cggctctgag 300 cataatgggc aggtagctga ggacgcagtg tcccggccaa aggatgacag cgtcccagag 360 gtccgagcgg ctcggaattc cgagcctcag gaccagggag agctcttcca gggcgtggat 420 ccccgggcgc tggccgcggt actgttgcag gcactggacc gtccggcctc gcccccggct 480 gtcccggcag gttcccagca gggaacaccc gaagaagcag cagaagctct gctgaccgag 540 tccgtgcgca gtcagaccca tagcctcccg gcatcagaaa tccaagcgtc cgctgtggcg 600 ccccctcgcc ctcagactca ggacaacgat cccgaggcag acgaccgctc agaagagctg 660 gaggcactag catccttgct ccaagaactt cgagatttca gtccgagtaa tgctaagcgc 720 cagcaagaga cggcggcagc agagactgaa acccgcacgc acacgctgac ccgagtcaat 780 ctggagagcc ccgggccaga gcgcgtatgg cgcgcttcct ggggagagtt ccaggcgcgc 840 gtcccggagc gtgctcctct gccgccctcg gtcccttctc aattccaggc tcgaatgtcc 900 gaaaacgttc cccttcccga aacccatcag ttcggggaag gagtgtcctc ccctaaaaca 960 catcttggtg agactttgac acccttatcc aaggcgtacc aaagtctaag tgcccccttc 1020 cccaaggtgc gtcggctcga gggctcattc ctgggcggtt ccgaggcagg agagcgcctg 1080 cttcaacaag ggttagctca ggtagaggca gggaggaggc aggcggaggc cacccggcag 1140 gccgcagcgc aagaagagcg gctggccgat ctcgcctccg acctgctgct ccagtatttg 1200 ctgcagggcg gcgcccggca gcgcgatctc gggggtcgcg ggctgcagga gacgcagcaa 1260 gagcgggaga acgagaggga ggaggaggcg gagcaggaga gacgcggtgg tggggaggac 1320 gaggtggggg aagaggatga ggaggcggca gaggcggagg cggaggcaga ggaggcggag 1380 agggcgcggc agaacgcgct cctgttcgcc gaggaggagg acggggaagc cggagccgag 1440 gacaagcgct cccaggagga ggcgccaggc catcggcgga aggatgctga ggggacagag 1500 gagggcgggg aggaggatga cgacgacgaa gagatggatc cgcagacgat cgatagtctc 1560 attgaactgt ccaccaaact ccacctgcca gcagacgatg tggtcagcat catcgaagag 1620 gtggaggaga aacggaagcg gaagaagaac gcccctcccg agccggtgcc accccccagg 1680 gctgccccag ccccgaccca tgtccgctcc ccgcagcccc cacctcccgc cccggcccgg 1740 gatgagttgc cggactggaa cgaagtactc ccaccctggg atcgggagga ggatgaggtg 1800 tttcccccgg ggccctatca ccccttccca aactacattc ggccgcggac actgcagccg 1860 cccgcatcct cccgccgccg tcacttccat cacgcgttgc cacctgcgcg ccaccatccc 1920 gatctggagg cccaggccag gcgcgcgcag gaggaagcgg acgcggagga gcgccggctg 1980 caggagcagg aggagctgga gaattacatt gagcacgtgc tgctgcaccg cccgtgaccc 2040 gcccctgcgc gcccgctccc aactgcgcgc gccgccacgc ccccctccgt gtcgctcctc 2100 ctccctctcg gtgtttgcat gcgccccggc tccgcccctc ggctgccgcc cggccccgcc 2160 ccacaaggcc ccgccccggg ttctgtcagg accagacctg tcagacttct ttggggtctg 2220 atcctggggc cagcccaggc gggtgtgtgg tttgtgcgag tccccttaca cccccacttc 2280 ctccaggggc ctcgtcccca tctagtttct ctagcgactt cctggtccca aacggggaaa 2340 agctgttcta tttaatcgtg tgaagtgtct gtctcccagc cttggggccc ccggagcctc 2400 ccttctccaa attgctgtga acttacccac atcttgccct tctgttgtaa atacccctca 2460 cggaggaaat agttttgcta agaaataaaa gtgactattt tattagg 2507 22 2175 DNA Rattus norvegicus misc_feature (1)..(2175) where n may be a or g or c or t/u, unknown, or other 22 atgagcgcag aggggtacca gtacagagcg ctgtatgact acaaaaagga acgagaggaa 60 gacatcgacc tacacttggg ggacatcctg actgtgaata aaggctcctt agtggcactt 120 ggattcagtg atggccagga agcccggcca gaagatattg gctggttaaa tggttacaat 180 gaaactactg gggagagggg agactttcca ggaacttatg ttgagtacat tggaaggaaa 240 agaatttcac cccccactcc taagccacgg ccccctcgac ctcttcctgt agcacccggt 300 tcttcaaaaa ctgaagcaga cactgagcaa ccagtgttga cccttcctga tctggccgag 360 cagtttgccc ctcctgatgt tgccccgcct ctccttataa agctcctgga agccattgag 420 aagaaaggac tggaatgttc aactctatac agaacacaga gctccagcaa ccctgcagag 480 ttacgacagc ttcttgattg tgatcccccc tcagtggact tggatgtgtt cgatgaacac 540 gttttagcag atgctttcaa acgctatctc gccgacttac caaatcctgt cattccagta 600 gctgtttaca atgagatgat gtctttagcc caagaagtac caagctccga agactacatc 660 cagctgttga agaagctcat taggtcgcct aatatacctc atcagtattg gcttacgctc 720 cagtatttgc tcaagcactt cttcaagctc tctcaagcct ccagcaagaa ccttctgaat 780 gcaagagccc tctctgaaat tttcagccac gtgcttttca gattcccagc agccagctct 840 gataatactg aacacctcat aaaagcggta gagcttttaa tctcagcgga gtggagtgag 900 cgacagccag caccagcact gccccctaaa ccacccaagc ccacttctat agccaacaac 960 agcatgaaca acaatatgtc cttacaggat gctgaatggt actggggaga tatctcaagg 1020 gaagaagtaa atgaaaaact ccgagacact gctgatggga cctttctggt acgagatgca 1080 tccactaaaa tgcacggnga ctacactctc acactaagga aaggaggaaa taacaaatta 1140 atcaaaatct ttcaccgaga tgggaaatac ggtttctctg atccattaac cttcaactct 1200 gtggttgaat taataaacca ctaccggaat gagtccttag ctcagtacaa tcccaagctg 1260 gatgtgaagt tactctaccc agtgtctaaa taccagcagg atcaagttgt caaagaagat 1320 aatattgaag ctgtggggaa aaaattacat gaatataata ctcaatttca agaaaaaagt 1380 cgggaatatg atagattata tgaggaatac actcgtactt cccaggaaat acagatgaaa 1440 agaacagcta tcgaagcatt taatgacacc ataaaaatat tcgaagagca gtgccacccc 1500 caggagcggt acagcaaaga ctacatagag aagtttaaac gcgaaggcaa cgagaaggaa 1560 attcaaagga taatgcataa tcacgataag ctgaagtctc ggatcagcga gatcatcgac 1620 agcaggagga gactggagga agacttgaag aagcaggcag ccgagtatcg cgagatcgac 1680 aagcgcatga atagcattaa gccggacctc attcagctga gaaagacaag agatcaatac 1740 ttgatgtggc tgacgcagaa aggggtccgg cagaagaagc tgaacgaatg gttgggaaac 1800 gaaaacacag aagaccaata ctcactggta gacgatgacg aggatttgcc ccaccatgat 1860 gagaagacgt ggaatgtggg gagcagcaac cgaaacaaag ccgagaacct attgcgaggg 1920 aaacgagatg gcactttcct ggtccgggag agcagtaagc agggctgcta tgcctgctct 1980 gtagtggtag atggcgaagt caaacattgc gtcatcaaca agactgccac cggctatggc 2040 ttcgccgagc cctacaacct gtacagctcc ctgaaagagc tggtgctaca ttaccaacac 2100 acctccctgg tgcagcacaa tgactccctc aatgtcacac tagcataccc agtatatgca 2160 caacagaggc gatga 2175 23 2116 DNA Rattus rattus 23 cgcaactgag aagactggat agagccggcg gagccgcgaa cgagcagtga ccgcgctccc 60 acccagctct gctctgcagc tcccaccagt gtctacccct ggacccctcg ccgagctttg 120 cccaaaccac gaccatgatg ttctcgggtt tcaacgcgga ctacgaggcg tcatcctccc 180 gctgcagtag cgcctccccg gccggggaca gcctttccta ctaccattcc ccagccgact 240 ccttctccag catgggctcc cctgtcaaca cacaggactt ttgcgcagat ctgtccgtct 300 ctagtgccaa ctttatcccc acggtgacag ccatctccac cagcccagac ctgcagtggc 360 tggtgcagcc cactctggtc tcctccgtgg ccccatcgca gaccagagcg ccccatcctt 420 acggactccc caccccgtcg accggggctt acgccagagc gggagtggtg aagaccatgt 480 caggcggcag agcgcagagc atcggcagaa ggggcaaagt agagcagcta tctcctgaag 540 aggaagagaa acggagaatc cgaagggaaa ggaataagat ggctgcagcc aagtgccgga 600 atcggaggag ggagctgaca gatacgctcc aagcggagac agatcaactt gaagacgaga 660 agtctgcgtt gcagaccgag attgccaatc tactgaaaga gaaggaaaaa ctggagttta 720 ttttggcagc ccaccgacct gcctgcaaga tccccaatga cctgggcttc ccagaggaga 780 tgtctgtgac ctccctggac ttgactgggg gtctgcctga ggctaccacc ccagagtctg 840 aggaggcctt caccctgcct cttctcaatg accctgagcc caagccatcc ttggagccgg 900 tcaagaacat tagcaacatg gagctgaagg ctgaaccctt tgatgacttc ttgtttccgg 960 catcatctag gcccagtggc tcggagactg cccgctctgt gccagatgtg gacctgtctg 1020 gttccttcta tgcagcagac tgggagcctc tgcacagcag ttccctgggg atggggccca 1080 tggtcacaga gctggagccc ctgtgcactc ccgttgtcac ctgcactccc agctgcacta 1140 cctatacgtc ttcctttgtc ttcacctacc ccgaggctga ctccttccct agctgcgcag 1200 ctgcccaccg aaagggcagc agcagcaacg agccctcctc tgactcactg agctcgccca 1260 cactgctagc cctgtgagca gtcagagaag gcagggcagc cggcactgac tgagctggtg 1320 cattacagag aggagaaaca cgtcttccct cgaggggttc ccgtagacct agggaggacc 1380 ttatctgtgc gtgaaacaca ccaagctgtg gacctcaagg acttgaaagc atccacatct 1440 ggactccagt cctcacctct tccggagatg tagcaaaaaa acaaaaaaac aaaacaaaaa 1500 aaaaacaaaa caaaaaatca aaagcaaccg catggagtgt attgtttgta gtgacacctg 1560 agagctggta gttagtagca tgtgagccag gcctgggtct gtgtctcttt tctctttctc 1620 cttagtcttc tcatagcatt aactaatctg ttgggttcat tattggaatt aacctggtgc 1680 tggatatttt tcggattgta tctagtgcag ctgattttaa caatacctac tgtgttcctg 1740 gcaatagtgt gttccaattt agaaatgacc aatattaaac taagaaaaga tagaacttta 1800 ttttccggta gatagaaata aatcgctata tccacgtact gtagctcttc agcgtccatg 1860 ttcattgtca tgtaactgat catgcattgt tgaggtggtc tgaatgttct gacattaaca 1920 gttttccatg aaaacgtttt attgtgtttt caatttattt attaagatgg attctcagat 1980 atttatattt ttattttatt tttttctatc ctgaggtctt tcgacatgtg gaaagtgaat 2040 ttgaatgaaa aaattttaag cattgtttgc ttattgttcc aagacattgt caataaaagc 2100 atttaagttg aatgcg 2116 24 617 DNA Rattus sp. 24 gatataaata agtttattgc tggatttcct cattaacatt atagaaatat tttaaaatca 60 ctaaaagtca caaattgaga gcccaacagg taaccatact ttataaaaaa gaaagtataa 120 aagcggctta taaaattttc tgaaaagtat atcagctgac agcaaagaaa taggaaccag 180 tcagtaaatg gcacaaatac atcactgaag ttctcagtcg tcacctacaa gtcagtgtct 240 gagctatgaa gtagacctgt tttataagtt acacactgag tagtgttccg aactgtcctt 300 cctcacttcc ataagagaag ccaaatcttt tactgctatg gggacaaagt actatccata 360 aactggagaa gaaggctcac cttcctctcc cagtacctga acaacagaac agtgtgcaca 420 gaaacggctt tggcacttga accctcagac actgtcttaa acgttgttgc ttgaatattg 480 tagttttacc tggtgctctg acctacattg ttgtctccat gctctgtgga tatcacacac 540 acacacacac acacacacac acacagatat aaacagccaa ggcactcata cacataaaag 600 taacaataat aattttt 617 25 7204 DNA Rattus norvegicus 25 cgcaaagttg ggcagggagc tgggggagga ggaccgcgag cgagggtggc ggagcagggg 60 cgggaggagc ggagggagga ggggaccgga gcgtgtcact cgcgcgctcc ctctgtgcac 120 agaggatgtg ctgaatggtg cgcttccagg cggcggccga gcaggatcag gcggcgggcg 180 gctcgcactg ccgggctctg ctccttcttg tctcccgagg ctctacaatc accgcgggct 240 ccagaccctg cgtcccgccc ggggcatggc aggctgctgt gccttgcggt gtagtcccgc 300 ttactaagcg gcgcgggcca gaggtgcgga ggccaagagg ccgggaggtc ggcgggcagc 360 ggaggcaaga gcctgctgaa ccgagagcca agcccgcttg cgcccggagc tccgtgtccc 420 gtctccactg cgctcgcccg ggccgcccgg agccccgatg agcccagatg gctggggctc 480 agcccggagt gcacgccttg caactcaagc ccgtgtgcgt gtccgacagc ctcaagaagg 540 gcaccaaatt cgtcaagtgg gatgatgact ccactatagt tactccaatt attttgagga 600 ccgatcctca gggatttttc ttttactgga cagatcagaa taaggagacg gagctgttag 660 atctcagcct cgtcaaggat gccaggtgtg ggaagcacgc caaagctccc aaggacccca 720 agttacgtga acttctggat gttgggaaca tcggacactt ggaacagcgc atgataactg 780 tggtgtatgg gccagacctg gtgaatatct cccacctgaa tcttgtggct tttcaagagg 840 aagtggccaa ggaatggaca aatgaggttt tcagtttggc aacaaacctg ctggctcaga 900 acatgtccag ggacgcattt ctggagaaag catatactaa gctcaagctt caggtgaccc 960 cagaagggcg cattcctctt aaaaacatct atcgactgtt ttcggcagac cggaagcgag 1020 tggaaactgc gctagaggct tgtagtcttc catcgtcaag gaacgattcc atccctcaag 1080 aggactttac tccagatgta tacagagttt tcctgaacaa tctttgtccc cgacctgaaa 1140 ttgataacat cttctctgaa tttggtgcca aaagcaaacc gtaccttact gttgatcaga 1200 tgatggattt tatcaacctt aagcagagag atccccggct gaatgaaata ctttacccac 1260 ctctgaagca agagcaggtc caagtgttga ttgagaagta cgagcccaac agcagcctcg 1320 ccaagaaagg gcagatgtca gtggatggat tcatgcgcta cctgagcgga gaagaaaatg 1380 gagtcgtttc acctgagaaa ctggatttga acgaagacat gtctcagccc ctgtctcact 1440 atttcatcaa ttcctcacac aacacctacc tcacagctgg ccagttggct gggaactcgt 1500 ctgtagagat gtatcgccag gtgcttctgt ctggatgtcg ctgtgtggag ctggactgct 1560 ggaagggcag gaccgctgag gaagagcctg tcatcaccca tggattcacc atgacaacag 1620 aaatatcctt caaggaagtc atagaagcca tcgcagagtg tgcgttcaag acttctcctt 1680 ttcccatcct cctttccttt gagaaccatg tggattcccc gaagcaacaa gccaagatgg 1740 ccgagtattg ccgattaatc tttggtgatg ccctccttat ggagccactg gaaaaatacc 1800 cactggaatc tggggtacct cttccaagcc ctatggattt aatgtataaa atcttggtga 1860 aaaacaagaa gaagtcacac aagtcgtcag agggaagtgg taaaaagaag ctctctgagc 1920 aagcttccaa cacgtacagc gactcttcca gcgtgttcga gccttcgtct ccgggagctg 1980 gggaagcaga tacggagagt gatgacgatg acgacgatga tgactgtaaa aagtcttcca 2040 tggatgaggg gactgctggc agcgaggcca tggccacaga agagatgtct aacctggtga 2100 actatattca gcctgtcaag tttgagtcct ttgaaacttc aaaaaaaaga aataaaagct 2160 ttgaaatgtc ttccttcgtg gaaaccaaag gactcgaaca actcacgaag tctccagttg 2220 aatttgtcga atacaacaag atgcagctta gcaggatata tcccaaagga acacgcgtgg 2280 actcatccaa ctacatgcct cagctcttct ggaatgctgg ctgtcagatg gtggcgctca 2340 acttccagac agtggatcta gctatgcaga taaacatggg catgtacgaa tacaatggga 2400 agagtggcta caggctgaag ccagagttca tgaggaggcc agacaagcat tttgatccat 2460 ttactgaagg aatcgtagat gggatagtgg ccaacacttt atctgttaag attatttcag 2520 gtcagtttct ctctgataag aaagttggga cttatgtgga agtggatatg tttggtttgc 2580 ctgtggacac aagaaggaag gcatttaaaa ccaagacatc ccaaggaaat gctgtaaatc 2640 ctgtctggga agaagagcca attgtattca aaaaggtagt tctgccttct ctggcctgtt 2700 taaggatagc agcatatgag gaaggaggca aatttattgg ccaccggatc ttgcctgtgc 2760 aagcaattcg gccaggctat cactacatct gcctgcggaa tgagaggaac cagcccctga 2820 tgctgccagc tgtctttgtc tacatagaag tcaaagacta tgtcccagac acgtatgcag 2880 atgtaattga agcattatca aacccaatcc gatatgtcaa tctgatggaa cagagagcta 2940 agcagttggc tgcattgaca ctggaggatg aagaggaagt caagaaggag gctgaccccg 3000 gggaaacgtc atccgaggct ccaagtgaaa ccaggacaac tccagcagag aatggggtga 3060 atcacaccgc aacccttgca cccaagccac cttcccaggc tccacacagc cagcctgctc 3120 cagggtctgt gaaggcaccc gccaaaacag aggatctgat tcagagcgtg ttaacagaag 3180 tagaggcgca gaccattgaa gagctcaagc aacagaaatc gttcgtgaaa cttcaaaaga 3240 agcactacaa agaaatgaaa gacctggtca agagacacca caagaaaacc accgagctca 3300 tcaaggagca taccaccaag tacaatgaga ttcagaatga ctacttgaga aggagggcag 3360 ccttggaaaa gtccgccaaa aaggatagca agaagaaatc tgaacccagc agcccagatc 3420 atggctcatc cgccattgag caagacctcg cggccctgga tgcagaaatg actcagaagt 3480 tgatagactt gaaagacaag caacaacagc agctgcttaa tcttcggcaa gagcagtatt 3540 acagtgagaa gtaccagaag cgggagcaca ttaaattgct cattcagaag ttgacagatg 3600 ttgctgaaga gtgtcagaac aatcagttga agaagctgaa ggaaatctgc gagaaagaga 3660 agaaggaatt aaagaagaaa atggataaga agaggcagga gaagataaca gaagccaagt 3720 ccaaagacaa aagccagatg gaagaggaga agacagagat gatccgatca tacatccagg 3780 aggtggtgca gtacatcaag aggttagagg aagcacaaag taaaagacaa gaaaaacttg 3840 tggaaaaaca caaggagatc cgccagcaga tcctggatga gaagcccaag ggggaaggct 3900 cctcctcagt cttgtcggaa agttgccatg aggatccctc tgttcccccc aactttactc 3960 cccccaaccc tcaagctctc aagtggtgag caccgtcctt ctagccagct gcagatggag 4020 ctggagcaag aataccaaga caagttcaaa agactgcccc tggagattct ggagtttgta 4080 caggaagcca tgaaagggaa ggttagtgag gacagcaatc acggctctgc ccctccctcg 4140 ctggcctcag accctgctaa ggtgaacctc aagtctccct ccagtgagga ggtacaagga 4200 gagaacgcgg gaagagagtt tgatactcct ctgtgatgtc cctgccgggc ctaccagaca 4260 tgcacggctg cttgaactcc atcggactct aaagacaaag atcactgccc gggccatctt 4320 cctgagaaac atcccttagc ctgaaatcca caccaaaggg agagttccag aaggatccgt 4380 gtgaaggtcc catacccttg tcccttgtgt catgtggaaa ctattgtggt cttagagaga 4440 agggtgcatg tatgcaggat ttttcttttc gttccaatag taaattaaaa gcaggcagct 4500 ccaggctcca tggactatgt aatgaaggac aatgtcttct ttgaagaaaa ctagagctcg 4560 tgtcttcgtt tgaagccctg gtgtacagta tttccaagta aaagagagag tttgagaagt 4620 gcgcggcacc atttaacact ctggaacatt ccactctgag cattgtttcc ctgactgccc 4680 cgcaaaccca tgttttcaag ttaacgtgta tagtgcattg tttcaccctt tgctttgcag 4740 gcactggtgg cttgccattt gctaatttat ttatcccaga ggcatctgta tttgctattg 4800 acatggcttt attagatacc atagtgactc atataagctg gtttctttta taaaaaaaaa 4860 tcacatgact gtatcatttc ccagtgaagc cattttacga ttagcaatat gggttgtata 4920 tttggctgca cgtctttgga tatctatcgg ttgttgacct gaggctatca aacagctgca 4980 atgggtttgt tctgttgagt cagggtttcc gccagactga acaggatcca gtgggctatt 5040 ttattgtcag atattttttc gtggttcatt tatttttact gtagaaagga agatagaata 5100 tttatctaaa tgcacatgta taaatgatat acattatctc catgtatata gtacatgtat 5160 gcacccagcc atataagtgc acacacatac atacgcatga gtgtacatgt gtgtttatta 5220 attggcagtg acccaaatct cttccataag atttaaaagc aagttcaggg atacatggac 5280 agagaagaaa tgggtcgaat atatgttatg tggataatta aattatacgg aaatgctaag 5340 aatcagttta tggagtcaat gatgcaactt gaggtaatct tagaatttag aaaatgagtt 5400 tactaaaaaa attggaggat caaaaattgg attttaagta tcagattttg agcttgtttt 5460 aatggtcagt gtatgcaaaa atgcatagga aatagagtgc aaatgtatta ctactatctt 5520 caataaattt taactgaata atttcaattt tatactgata gaaataaagc tctggttgta 5580 ttttagtttg tatatttact tcacttagtt atgtcttttt taattcttat tttattctaa 5640 agtgaataag agaaacaaat tatatcatgc taacgctaat aaaatcataa ttaaaaggtc 5700 ttaagaaata atattatata gaaaaaatat ggtttagaga gtgggacaaa gaacattaaa 5760 acccagcaat gacaatgccc atggcttcat tttattttaa agtgaacctg tctgcttata 5820 atttagcaaa gcaaaaaaca gttgtgtcct gaagtttgtg tttcaagaat ttagtatttt 5880 tctggaaata attttattta actttaagca ataactagga ccataattaa gtgttaatca 5940 aaatgaaggc ttgtttcaaa caccatgaaa gtgtttgatc acacacatac accacacaca 6000 cacacacaca cacacacaca cacacacaca cacacacaca cctattgaga cacaagggga 6060 aaaaaatcac atccccttca gagatgattg tattttatct gaaggatcaa gagtgttaat 6120 tagtcactta ctaaagctta ccttccactg aagagagatc tcttgtgaac catacaactt 6180 ttgcactttg aaggaaaggc attactcaga aggaagtgag gacagtccaa atggaatgct 6240 gcattttaca cacacagtta cagactgttg aaaattgggg aagagtgaat ggataacagg 6300 attcgaaacc aaacttagtg gtggttgttg cggaacattg gatttcaaca tactccatac 6360 agcgactgag gttgaataac cttgactttc aaatcctgtt ttggtagttg gacttcatta 6420 tcttggtgat tctagtcatt ttacaatgtt ttgtatttgg tcatttactg taatcacatt 6480 tttatatctg tacagtgaca ctttttgcag ttgtggggta gtgtgtaaca ctgtgcatct 6540 tgcatcatcg aaactactac cgtgatacta tccattgata atattaatat tacttgaaaa 6600 atgacaaagg taaagaaaag gggtctgtat gatgtgcagt tttgtgcctt tatgtatttg 6660 ccttgttctt cgttgaatgt gtgaaattct gtactgtggt ttttccaata gtagaaagta 6720 gagtcatgca ttaaattaga ctgtatccct gacaccttta aactactgag aataacgtgg 6780 ttggccgtgt aattcagtgt tcaaagttct aatgacatgc catgtgcttt ggtttttaac 6840 atttcatgac caccatacat taattaatac tcctgtaata gataagcagt cattaattaa 6900 gttccaaaag aaagggccat tgcttgcatt cctttgaatt taatgttgcc cttgtacact 6960 gtgttaatac tgtatgtaat ggattgaaca ttgtgattct cgccttttaa gaagagaaag 7020 agagagaagg aaaagtattt gatgctctta aaatgtacat atttgggttc ttctatctca 7080 aattatttaa aatgcataat tcacattttt ttgtaatcat tctatgcaat tttgtggcat 7140 gacgtttctt ccacttgtaa ttttctgtgc tttcatcaca agtccaaagg aaacaataaa 7200 aatt 7204 26 3700 DNA Rattus norvegicus 26 ccaagaggaa gaaacatgaa gtttttgctg cttagtgcac ttttattttt gcatagttcc 60 ttagcttgga caagagaaaa gcattattac atcggaatta ctgaagcagt ttgggactat 120 gcttctggca gtgaagaaaa ggaacttatt tcagttgaca cggaacagtc caatttctat 180 cttcgaaatg gtccagatcg tattggaaga aagtataaga aggcccttta ttctgagtac 240 acagatggca cctttacgaa gactatagac aaaccagcct ggctagggtt tttaggccct 300 gtcatcaaag ctgaagttgg agacaaagtt tctgttcacg taaagaactt tgcctctagg 360 ccctacactt ttcatgctca tggggtaact tacaccaagg cgaacgaggg ggccatctac 420 cctgacaaca ccactgattt tcaaagagcc gatgacaaac tgtttcctgg acagcagtat 480 ttgtacgtgc tgcgtgccaa tgagccaagt cctggcgagg gagacagcaa ttgtgtgacc 540 aggatttacc actctcatgt ggatgctcca aaagatattg catcaggact cataggaccg 600 ttgatactct gtaaaaaagg ttctctgcat aaggaaaaag aggaaaatat tgaccaagaa 660 ttcgtactga tgttctctgt ggtggatgaa aatctcagct ggtacctaga agataacatc 720 aaaaccttct gctctgaacc agagaaagtc gataaagaca atgaagactt ccaggaaagc 780 aacaggatgt actctataaa tggatataca tttggaagcc tcccagggct ctcgatgtgt 840 gcagaagaca gagtgaagtg gtaccttttt gggatgggga atgaagttga cgtgcattca 900 gagctctttc atggtcaagc cctgaccagc aagaactatc atactgatat aatcaacctg 960 ttccctgcca ctctaattga tgtttctatg gtggcccaga atcctggagt ctggatgctc 1020 agttgccaga acctgaacca tctgaaagct ggtttgcagg cctttttcca ggttcgtgac 1080 tgcaacaagc cctcaccgga cgacgatatc caagacagac atgtgagaca ttattacatc 1140 gctgccgagg agaccatttg ggactatgct ccgtctggga cagacacctt cactggagag 1200 aacttcacca gtctgggaag tgattcaagg gtcttttttg agcaaggtgc tacaagaatt 1260 ggtggctctt ataaaaaatt ggtttatcgt gagtacacag atgattcctt cacaaaccgg 1320 aaggaaagag gccctgatga ggaacatctt ggaatccttg gtcctgtcat ttgggcagaa 1380 gtaggagaca tcattagagt cacctttcat aacaaaggac aatttcctct cagcattcag 1440 ccaatggggg taagattcac caaggaaaat gagggaacat actatggccc agatggccgt 1500 tcctcaaagc aagcctccca tgtggctccc aaagaaacct ttacgtatga atggactgtc 1560 cccaaagaaa tgggacccac ttatgcagat cctgtgtgcc tatctaagat gtattattct 1620 ggagttgacc tcaccaaaga tatatttact gggcttattg ggccaatgaa aatatgcaag 1680 aaaggcagct tacttgcaga tgggagacag aaagatgtag acaaggagtt ctacttgttt 1740 gcaacagtgt ttgatgagaa tgagagttta ctcttggatg ataatatcag aatgttcaca 1800 actgcacctg agaatgtgga caaggaagat gaagactttc aggagtccaa caagatgcac 1860 tccatgaatg gattcatgta tggcaatctg cctggcctca atatgtgcct aggagaatcc 1920 atcgtgtggt atttgttcag cgctggaaat gaggcagacg tgcatgggat atacttttca 1980 ggaaatacct atctgtccaa aggagaaaga agagacactg caaatctgtt tcctcataaa 2040 agtctcaccc ttctcatgac acctgacaca gaagggtctt ttgatgttga gtgtcttaca 2100 acagatcact acaccggcgg catgaagcaa aagtacactg tgaaccagtg caaggggcag 2160 tttgaagatg tcactctcta ccagggagaa aggacctact atattgcagc agtggaggtg 2220 gaatgggatt attcaccaag cagggactgg gaaatggagc tgcaccattt gcaagagcaa 2280 aatgtttcaa atgcattttt ggataaggaa gagtttttca taggctcaaa gtacaagaag 2340 gttgtgtatc gagagtttac tgacagcaca ttcagagaac aggtgaagag aagagctgaa 2400 gaggagcact tgggcatgct cggtccactg attcatgcag atgttggagc caaagttaaa 2460 gttgtcttta aaaatatggc aacaaggcca tattcaatac atgcccacgg agtgaaaaca 2520 aagagttcta cagttgctcc aacgttgcca ggtgaagttc gcacttatat atggcaaatt 2580 ccagaaagat caggtgctgg aacggaggat tcaccttgta tcccatgggc ttattactca 2640 accgtggatc gagttaagga tctctacagt gggctaatag gcccattgat tgtttgtcgg 2700 aaatcttatg tgaaagtatt caatcctaaa aagaaaatgg agttttccct tttgtttcta 2760 gtttttgatg agaatgaatc ttggtactta gatgataaca tcaatacata ccccgatcac 2820 cctgagaaag ataacaaaga caacgaggaa ttcatagaaa gcaataaaat gcatgctatc 2880 aatgggaaaa tgttcggaaa cctccaaggt ctcacgatgc acgtgggaga tgaggtcaac 2940 tggtatgtga tggctatggg caatgaaata gacctgcaca ctgtacactt ccacggccac 3000 agcttccaat acaagcacag gggaattcat agttctgatg tctttgactt tttccctgga 3060 acataccaaa ccctagaaat gtttccccaa acgcctggaa cctggttact ccactgccat 3120 gtgactgacc atattcatgc ggggatggta actacctaca ctgttttacc aaatcaagag 3180 actaagtctg gctgaaagaa ataaattggt gataagtgga atacgagcac aatgacgttg 3240 ttttaaacat ttaaaaaaat caaagccaca caaatgttca tttgtgaggg aattggtaat 3300 gccgatggac agatgaacag actgtatcat gacatgtatt tgtttgctgg gtaacagaat 3360 cgctttacat agtccactta cacctgcact gaaaggactc tgaaaagtgg aaaaaaataa 3420 gcaaaaccgt atgatcagat gctgtccttg actgtcctca caggatcact ataaagtcca 3480 ctaaactgtc tccaactctt ctcatcaagt cctctaacaa accatggggt aagagggtat 3540 agaaaagaag gaaagatgaa gataccaaga tgcactttgt aaaaatctga aaaacagttg 3600 aaggatgctc tcggaaaata gagaaagtca ggatccaatt atgttacatt ttgaaaaaat 3660 gaaatggaga taataaagta ataaatttta aaatgccaat 3700 27 4011 DNA Rattus norvegicus 27 ggatccagca tgtcttggct ctgtgggcat ctacactcgc atgcacatat ccacacagag 60 acataaaatt tacaataata aaaataaatc taggggttgg ggatttagct cagcggtaga 120 gagcttgcct agcaagcgca aggccctggg ttcggtcccc agatccaaaa aaaaagaacc 180 aaaaaaaaaa aaaatcttaa aatgctcttt caggctgcca taatggcctt taatcccaga 240 agcatccggt cccctgcagg tggcagaaca gggactgcca cttctttgga ggacaatttg 300 tcagctgtct gtcaaaactt gaatatcctg tagcttagac ttacatgtgt gagaatttat 360 cccggatgag ttcatattca tagttaaggg acagcaacac caacattttt ttaaaaagca 420 gaacgttgtg ttcgtaaata ggaaccaatc cgaggctaat tcagggtctt cctgagaact 480 attcagcctt agaaagtaac aaactaggtc tctgtcttca gacacgatgc aatctttcag 540 ctaaagtaaa aagaaaattt ccagaacagt ttgatgtatg gacaaaaata ttttcccaaa 600 aactacttaa gaaatgtcaa catggctacc tcaagaaaaa aaaatgagaa acctttaatt 660 ttcagaatgt actttcagct tggcatggtg acacccacct ttactctcag ggctcaggag 720 gcagagacag aggctggtgg atctctataa gtacagggcc atcctggtct acactgtgag 780 ttccaggtca tccagaactc catgtgagac catgtctcaa aaaagaaagg aatgctcttg 840 tatactagtt gattccatcc acctccatta tcaaatcttt cattatgacg attttttgtt 900 tttccaaagc acagagtaaa atgttaaaac aggaacacac acacacacac acccctaaat 960 aaaggtgatc tttaaggacc tgtttatctc agacccccaa tgcaatcaca tctagccttt 1020 tctgtggatg gtttcaaact cagatcctca tgcaagtcaa tcctgcttgc acctcatgtg 1080 ctcttaacca cttcagggtt cctaagtgtg ccggggactt catgttatct ggttatgtct 1140 ctccttagca agcgcacagg tgaagaaact ggtgttaggt ggaaacgaac gttttaaatg 1200 ggaaacagag gactcgaact agtgattcta cggactggag ggcgccgcgc atcccgggct 1260 ctcacatggc cctcagcccc tgcggggcgc acccgtggcc cgacgcgcac gaggctccac 1320 gctctgggga ctcgcgcgca cgtagcggcg cgagcccggc ggcggcagcg acgtcacagg 1380 ccgagctttc cttttcggga gtccccggca tacatcctgt gtccatgttt ggtcatttac 1440 gtcacggcgc cagggccggg gcctcccgaa atggcagtag cccggggagt cggaagcccg 1500 gagccagcgc cgccgcagct atataagtgg gggaactgta ggttggggga gcccggttgc 1560 actttggaga aaccaggagc cgccgcccga ggcaggtgcg ggcgagccag ggcgccgcgg 1620 ccccccaacc ctcgtccccg agccaagcgc cgggagcctg gagctggggc gcccggattc 1680 cctgccagcc acgcgcgccc cagcccagcg aggccccggg cgccccgccg ccaccacgcc 1740 atgctccacc tgagcgactt ctccggcccc gacgcgctcc tctccaagcc caccgaaggc 1800 tgcgcccaca ccagccccga gctgccccgg ctgcctgcta gggacgctcc ctcggccgcc 1860 gcgtatcctg gaggtaagga gcggggctag gagctcgaag aagccgaatg cgacgagggg 1920 tgatgggaga ggcgtgcggg ggtggggtgg gggtccccaa gcgcgcggta ccattcccac 1980 taaggccaca ggcgtctgtg tgggagcccg tgcgtatgcg cgccctgggg cgtccgtggg 2040 ggtcccctgc gctgcaggat ccgaagtgca cgaggcgctc tgcgggtggc ttgcgccagg 2100 agggctcggg tcgagggact cgcggctgca ggctccgagg cacgcccttg gatgtgtgcg 2160 gggctgatgc gtggtccctc ctttggcagg cgacttcttg agctgggctc tgagcacctg 2220 cggcgccggg ggggacttaa cagactcctg cttcctggag ggccctgcac ccacgccccc 2280 ttcgggcctc agctacagcg gcagcttctt catccaggcg gttcccgaac acccgcacga 2340 cccggaggcc ctcttcaacc tcatgtctgg catcttgggc ttggcaccct tccctagccc 2400 cgaggcggca gcgtctcggt cccccctgga tgtccctttc cccgcgggtc ccgatgcctt 2460 gttgccggac ctttactccc cggatctgag ttcggccgcc ttcccggagg cgttttggga 2520 ggccgcgcct tcggcgggcg ctccctcgca gtgcctgttc gagccccagc tctccccgcc 2580 cgacgtcaag cccgggctga gggcgcctcc cgcttcgcca gcgctggacg ctgctgcttc 2640 ggccttcaaa gggccctacg ccccctggga gctgctgtcg gccggggctc cggggaactg 2700 tgggtcgcag ggaagcttcc agaccacccc ggaggcacgc ttttccgccg tggggaccaa 2760 ggtcgaggac ctgctgtcca tcagctgccc cgccgagctg cccggtccgg ctagcagact 2820 ctacccgcca ggggcctacg atgccttctc gctggcccca ggtgacttag gggaggggac 2880 cgagggcctc ccggcgctgc tgacccctcc gggcggggag ggagggagcg gcggcgaagg 2940 cggagagttc ctggccgtcc ctcaagcgca gctgtccccg ctgggcctgc gcggcgccgc 3000 cacggcagac ttctccaaag ccctggtggc cgacctcccg gggggcagcg gagtggcggc 3060 gccttcatcc cccgccacct ccttccccgc ggccaaagcc cggcgcaagg gacgccgggg 3120 cggcaagtgc agcgcgcgct gcttctgccc gcggccgcac gtcaaggcct tcgcctgccc 3180 cgtggagagc tgcgtgcgga cgttcgcgcg ctccgacgag ctcaaccgcc acctgcgcat 3240 ccacacgggc cacaagccct tccagtgccg catctgcctg cgcaacttca gccgcagcga 3300 ccacctcacc acgcacgtgc gcacccacac cggcgagaag cccttcgcct gcgacgtgtg 3360 cggccgccgc ttcgcgcgca gcgacgagaa gaagcgacac agcaaggtgc acctcaagca 3420 gaaggcgcgc gccgaggagc gcctcaaggg cctgggcttc tactcgctgg gcctctcttt 3480 cgccgcgctg tagccggagc tggctccgta ggtcggcgcc gccggccgtc ggcgcacgcg 3540 acacggtcct gccgctccct cgtccctgct gccttccctg cctcttccac gcacgtccgg 3600 ggccacccgc agcccagctc cagttccccc gaagcgcccg ccgctcacgc ccttcagcac 3660 gggctccgcg gacagcgccc gctgttttcg gagccgcctt cctctagcca cccgctctgg 3720 ggactgtcct ctcggtccac ccacagagca ggcgatacct taggactgaa gagagttttt 3780 gtaactggcg tacgccccac gccttcctct ttatcccttc ccagagtcaa gctggggatg 3840 taccgagccg gtctctcaag aactttgtac agcaagtcca gcaagccttt ggatgtgatg 3900 tctttgcttt ggggttattt cctttttgtt gtcgttcatt ttttgtaaag cagacgctac 3960 tctcaagcat ttgacaaaac tgtttatttt tgcaattaaa attattgtgc t 4011 28 256 DNA Rattus norvegicus 28 ctgcccttga ctcctagact ccaggatgcc gggaccttgg ttgctgctgg ctctggcttt 60 gatcttcacc ctaactggta tccctgaatc ctgcgccttg ccggaggcag cccaggagga 120 aggtgcagtg actcctgacc ttcctggcct ggagaatgtt caggtccggc cagaacgtcg 180 attcttgtgg aaagacctcc agcgggtgag aggggacctc ggtgctgcct taggtaagca 240 ccaggggagg ggagcc 256 29 8797 DNA Rattus norvegicus misc_feature (1)..(8797) where n may be a or g or c or t/u, unknown, or other 29 agaagcaggg caagatggag cggagacgta tcacctctgc acgccgctcc tatgcctcct 60 ccgagacgat ggtcaggggc catggtccta ccagacacct gggtaccatt ccgcgcctct 120 ccctgtctcg aatgacgcct ccactccctg ccagggtaga cttctccctg gccggggcgc 180 tcaatgccgg cttcaaagag actcgggcca gcgagcgcgc ggagatgatg gagctcaatg 240 accgctttgc tagctacatc gagaaggtcc gcttcctgga acagcaaaac aaggcgctgg 300 cagctgagct gaaccagctt cgagccaagg agcccaccaa actggctgac gtttaccagg 360 cagaacttcg ggagctgcgg ctgcgtctgg accagcttac taccaacagt gcccggctgg 420 aggtggagag ggacaatctc acacaggacc tcggcaccct gaggcagaag tgagaagggg 480 aataggaaaa tggctagtga gcagagagac tgaattgaga gaggagcctg gaagaagggt 540 actgcctccc cctaaggaga gcctcttagc tgctgtgttt caaaagggag agctgtcttt 600 gttccctaat atgatgcaga atagccagga agagactgaa aacttcaggg atcagattcc 660 cagagagagc tgagcattct gggtacttac caccaagctg agggaggact gtgtgtgggt 720 agccagggta caagcaggca ggtgcttcag tcactgggca ctatgaatgg ccagagaggt 780 ttgcgtccac aaggagggct cttggtctct gaaatataag aaatgcaaga aagagcaggc 840 aggcctggat gccgggccca gatggctgtc tacgacctgc ttttgaccct gctcaagtca 900 ttttgattgg ttgtagctgt aaaggagagg tcatgataca atctgtccta aaagatggta 960 accgaggtca aactctgggg cgtggtgctt tgctgggtgg tggatattca cgtgtccctc 1020 tggtgcagga ggggcactgg tgtcagtgac agtgggcgca cggtggtgga tgcgcctctg 1080 ggaacaagag caggtggcac caaccccgat aaggcacctc agtaatgagt taaggaggaa 1140 ggctcttatc tgttcacgcg tcaccaggcc tctgtgtagg aacctgccat tgcttactct 1200 tcctgccact ctaccgatcc tgtgggatcc gtttgtaaca rtcccctccc tctggaggtt 1260 cggataggga cccaagagtc ctcaggaaaa cctaaagcct gtactgtgtt ccctgtccca 1320 ggctccaaga tgaaaccaac ctgaggctgg aggcggagaa caacctggct gtgtacagac 1380 aggtcagaga gatgagagga agggtgggga gggaaggcgg aggcccgcca ccaaacaccg 1440 cagttgtctg cttccaagat ccccacaagg agggggaaga caaagttgga aattttagta 1500 gctaggaccc tggggtcaaa gagattctct ccctgctcca ggaggcggat gaagccacct 1560 tggctcgtgt ggatctggag aggaaggttg agtcgctgga ggaggagatc cagttcttga 1620 ggaagatcca tgaggaggtg aggccaggga gggggaggga ggggccttac gagaaactgc 1680 aggtaataga cagagacgct cacagaggna aacngagaaa ccaaatttca ggaagaaaac 1740 aaaaacaaga gaaagagtna agacagagac ttagggagat gtgtccagta tttactaccc 1800 gtgnaaacac acgcaggcct tccgccgcca tgttctaaga agcacagaac cgagtctgag 1860 tttctccttc acatcagctc tcagccttat cactacctct tccacaactg tagggacctt 1920 gaggcctaga gagtgaaaca cacccaaagt cacacagcta gaacccctct agacctgtca 1980 cagtggtatg catctcagcc tgtgagtggt agacgcggaa cgtttgaact gttcaaggcc 2040 aatttcwgcn acannaghaa actgaaacca gccacgcncc gcncgcgcac atctattctc 2100 ttacctcaga acctngctgc aagacatatt yattcatgcc tggttcctga ctatgcagtc 2160 ctgagaggaa ggcacctcaa attctacttc catgcccatt agacgcttta cttttacttt 2220 ttgacaggat atcactgtgt agctggcctg gaacatgcac ttcctccagt aacatctgcc 2280 tcatttccgt gagccattgc tataaatggc agacagaggc gtgagtctgg gagagtgcag 2340 cctaggccgt agcttgagcc ttaatacccc ttcttttacc ttattaggaa gttcgagaac 2400 tccaggagca gctggcccag cagcaggtcc acgtggagat ggatgtggcc aagccagacc 2460 tcacagcggc tctgagagag attcgcactc agtacgaggc agtggccacc agtaacatgc 2520 aagaaacaga agagtggtat cggtccaagg tacctgtggg gcagcttgcc tcttccagga 2580 agtcttcctg ctccctcctg aagaataaag gaaggaagga agctagggaa gggtaaccta 2640 tctaggtgtc cagtcttgcc cagagagagg agacttgccc agaaccacgc aagcattatt 2700 aacttcccaa actactcttg gttactgcta ttataagtaa tatcagtaat agttaataat 2760 agctaaaggc tcaccagttt gctcagtgtc acacagctcc taagtgacaa agggatagcg 2820 tttcctatcc taacccatat catcagagag cctgcttttt gatgcctctg ccagcctatg 2880 ccctgcatta ccgaacatct actgtgtgtc acgttctttt tttctttttt tggagctggg 2940 gaccgaaccc agggccttgc acttgctaag caagcgctct accactgagc taagtcccaa 3000 ccccgtgtgc cacgttcttt agtccagatg aggccagcgt ttatacagac tgtaggaggc 3060 agggacgatg ggggatggac taggggtggg acttgtgcta gacaggcagc ctagagactc 3120 aggagaagag ggaagggcat ctcaggtgga gaacaggggc tgtgaggagg cctgagacag 3180 aaaagcacag agaccagaaa tgtcgctgcc cagggcgcca agcccagtgt gagttaatga 3240 gcctcacatg cagggacagt ctgggaggat gaaggcctaa gaagttcttg ggctacagtt 3300 gttcctccta agggagccct gaggttcctg gtttttcagt gatcatgtat atctcccagc 3360 tcaggcagct ctctgggctg tgccttgcgc actttccctc aatttttggt cctctctggg 3420 tttagtttgc agacctcaca gacgttgctt cccgcaacgc agagctgctc cgccaggcca 3480 agcacgaggc taatgactat cgccgccaac tgcaggcctt gacctgcgac cttgagtcct 3540 tgcgcggcac ggtgagcacg gatcactggt tggatggggc agcaagaggt tgctggaggg 3600 ggacagggat ggagggaccg gggtggcata ggcgaacacg gaagagatca ggggttgcag 3660 cctcgaagtt tctcaaaatc tcccggccaa gcgatcagtt ccatcccgca gaacgagtcc 3720 ttggagaggc aaatgcgcga acaggaggag cgccacgcgc gggagtcggc gagttaccag 3780 gaggcactcg ctcggctgga ggaggagggc caaagcctca aggaggagat ggcccgccac 3840 ctgcaggagt accaggatct actcaacgtt aagctagccc tggacatcga gatcgccacc 3900 tacaggaaat tgctggaggg cgaagaaaac cggtgagctc tacctcagcc cgagaattcc 3960 tcttgttcct ttacagtact ccggagttag ccttgtatcc ttcccaccca gagtgttcag 4020 cctacgaaca atagatattc aagaaatgtc tgaagaatag atgggtccaa ctactggtaa 4080 ttctcgggac tgaggacttg agttattccc cagattaagt ggcttaccac aagcatggcc 4140 aacaaaaatg gaatttccca tggtctgaag ggcttgttct gcctctatga agttattgtg 4200 acttaaaaaa aaaattccat ctccaaagcc tccttcctgg aagtattctg tgggggcaga 4260 taacacagtc ctgcagacac ttccggctcc tggagagtgt tggctgctgg gttgaaagtt 4320 ggtggtggag gctggagaga tagttagggc tttgggttcc aacatccatc ttatggcttg 4380 caatggcttg tagtaacact ccagttccag gggaacccac atctgccatt gtatgtgtgt 4440 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt atgtatgcat gcatgtatgt 4500 atgtatgtat gtatgtatgt atcagtcatt atgttgctgc tttagtccca gcacttgaga 4560 ggaagaggcc agcagatctc tgtgagtttg aggctaacct ggtatatata ttgaatcagc 4620 gccatataga gaggctttgt aagaaaaagt tggcgaagag aaagagagga gaggagacag 4680 agaggagcag aaaggagaga gagcaggaga gagacagaca gacagacaga cacacacacg 4740 catgcacaca ggacagagag aaaatgtgtg tgtgtgtgtg aaagagagag agagtgtgtg 4800 tgtgtgtgtg tatgtgagtg tgtgtgtgta tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 4860 tgtgtgtgtg tcctggtaaa gggctttgag attttcatgc tggccctgga ccagccctgg 4920 cctggggatg tcctagggca gcctcctcat ccctatccct ccttttccag catcaccatt 4980 cctgtacaga ctttctccaa cctccagatc cgaggtcagt atagtggggc cttgtgggaa 5040 agggtaatga atcccctccc tcctccccca ggcctgtgaa ctgctctcat ggactgggag 5100 gcaaagagac ccacgctctc cttgagagct tcgtgaccct ccacggagac cctagatgga 5160 gtgcagaatc ttagggctta tagctggtcc ctgcctccct ctgccctgtg tccccatccc 5220 ctggaaaacc caacaggtct caactgatgc caggtctgca agccatgaac aagctccgcc 5280 ttctaagtgt tttgtgttct tgtttttttt tttttttttt ctcctctctc tcttaactgc 5340 ttgtcactac agggggcaaa agcaccaaag aaggggaagg ccacaaagtc acaagacatc 5400 tcaaaaggct cacaatacaa gttataccaa tacaggctct cgccagattg taaacggagt 5460 cctgccagca ctccgttaga cacgtgcact tcagttgtgt acttttcccg gcttgccttc 5520 ttcctcccag gcctcttcta agggactgta ccgtgtcctt tgtccagaac cttcagtacc 5580 cacgctaggt cctggctctg tgtaattagg ttatactgat agagctagcc tacgttaaag 5640 gttaggtcat gctaacagag ctagcctatg ctaacagagc tagcctatgc tcaaggtgag 5700 gtcatgctaa cagagctagc ctgtgttaaa ggcagcatgt ctctgggagc tcaaaggaga 5760 tgctctgacc cctctgagca aatgcctctg cctcaccact tttgtgtcgc taggtggtcc 5820 ccttggattg cagtgcctgt gggcaggctc tgtgtttgat tcatgtgtcc ccagagttct 5880 attgcttcat tcagtgctga ctcagcccag aggttccccc tggatggctg ctcttgtagt 5940 gaataaactc taggttccct gctcttcgtt ttacatatta gtgagtggcg acacgtatat 6000 tttcctctct aagggggttg gatcaaatta cgttttactg gtatgtggcc cttaaatgat 6060 tcttagatat gggggaagat tccatttagg aacattggcc gtgcatattg cagggtgtaa 6120 ttgagcttta gaaagttaac catgctggcc aggcaaaggt gccaaatacc aggcagtcaa 6180 aggtcactag agccagataa gtgacaggga aataagaatg gggaaaggtc cgtgcgtatt 6240 aaccactaat ctgcaccacg tggtatacgt cagtgagttg acactcgcat gtccaggagt 6300 agtcactatg agcccatttt gtaagagagg aaactgaggc tccttccttg gtgcagccct 6360 tggctgtctc tcagccaggt gccagaacag aaggcagagc gggctacagg acagcagcag 6420 ggtgtgccca gcgctggggg acagatggga gtctccagtc acttcggaaa agtcatgctt 6480 ttcgctgcct agctagttag ccccgtccgt attcacgatc ctgggtgcat gatggggaac 6540 ttgggtgcag ggcaggatgt ctagtgtccc aagagcccac ggagacctcc tctcctcgta 6600 cctgcagaaa ccagcctgga caccaaatct gtgtcagaag gccacctcaa gaggaacatc 6660 gtggtaaaga cggtggagat gcgggatggc gaggtgagga aacgttccag tggccccggg 6720 gagttcttga ggctgtactg gagaaagcct ggtactagct cactgaaggt ccttattaac 6780 taccaggaga gctctggtag aagggatggg ccttgaatgt aattctcctt ttccagctct 6840 ttcatgggtg gcgtgctaag ggcaacctgt atatggaggc caccatcttt ctggaacact 6900 tggtggaggg aggctttcaa attttactca gaagaacttt ccatagggga attggaagag 6960 gggctagaca tagaatggcc gtcttcattt gttgggatgt tagcagcaat ggttgccctt 7020 acgtcatagt cctctacaag tggcatttca tatctcaatg tgtgatgcaa tcaagccgat 7080 agtcctggaa gacacatcca gggcagtcag gggctcgcct cggggatgct cagcctagca 7140 aactggatct gggattctgg agcccctaac tctgtgctgt gactcttccc tctgcctgca 7200 ggtcattaag gagtcgaagc aggagcacaa ggatgtgatg tgaggtgtgc ccagctggcg 7260 gcccttgcca tacagtgtga gggcctaaag ctccctcctc agatagtctt gtttgctagg 7320 cccaattccc atccacacca gtgctcccct tccttctgtt tttatgccca cggctcggtc 7380 agtgcggagt ctcatggacg gcacagacca ccctgcatct ccaactaaca ggatactcac 7440 cccaaagggg caatcaggag gggaggaccc ccctcccccc agctgggtta gaactggaag 7500 aaagaggaaa gacaggggca gggagactta acaaatccct tccttcatcc ttgttgttat 7560 ggaaaccgtt gccagagctg gaggtctctg ggaactggac tttgagtttt cataggctgc 7620 tggagcaaga caaacattca gacagaaagg aaaagttccc gaggcaaaga atctctagcc 7680 agaggcctag gcatctggaa gaaccctgac gatgtaggag tgggtagggc agacttgcta 7740 cctggaatgg ccactaaggc agtcctgaag ggcccccctc cggagggatg accctcgtgt 7800 atcggcccca ctgagcagcc ctgcaggttg atgccccacg agcctgtgaa aacttggttc 7860 ttggcatgtg gcaggctcta tagcataagt ggagagggaa ggtgtactgg agggtataga 7920 ggagggctct ctggccccta agtatggatg cggagagggg ggagcccagg aaggctaccc 7980 cgctcaggct gcaggggtgc catggcggag gaaccggtgg agataacttg gacaatggag 8040 ttggaagttg taggcaacta gttacacttg gctctgaatc cttggaatca aggaaatgac 8100 ctgttctctc aaagacactg aaacaggaga gagggacttc catccactgg gcagggtaca 8160 ggcgcgtctc agttgtgaag gtctattcct ggttgctcag tccccaactg cgcatcaccc 8220 tgggcttctc aacctggaag agtccacaac catccttctg aggccctcca tccccacaac 8280 cactagctgt tgttctccaa gccaagggcc ccattccctt tcttatgcat gtacggagta 8340 tcgcctagac tttaagcgtc catcctgttt gaaagtttgg gaaactgaca cacgttgtgt 8400 tcaagcagcc tggtgtggag tgccttcgta ttagtgtacc ctctcggaag ctggttggtg 8460 ggcaggtgag gaagaaatgg agctgaaagt gtcccctcag ttgtcctttc ctccccctct 8520 aaggtccctc acttttccca ggacatcgta cactcccccc cttgtcacct ctgctaacct 8580 tcagagcagt actgtcacct ttactcactg ggcagaaata aagacagtgt cagaggcttc 8640 cacagagkat ctgatttgtt cagaaggtgg ggtgaggaca gacaagaccc aaccttgctc 8700 gttatcacca ccgaatgtct agcgagccta aacttccaca acaaaccctc cacagcagca 8760 gctggcgtta aatccgcctc caacgacgag gtgccct 8797 30 1453 DNA Rattus norvegicus 30 gacggtatcg atcgaacccc ttcgataaca gcggaatccc ccgtctacct ctctccttgg 60 tcctggaata gcgctaccga tcacaaagta gccctaagac ataataaacc ctcaactgct 120 ccgtagtttt tcttatgaaa gccaagtaaa ggggacgtaa gcaaaaaaat attttttttt 180 gcgtgaagga ttccaaaaat aaaattctct ggggattgag aagaaagaaa aaaaggaaaa 240 tgccagctga tataatggag aaaaattcct cgtccccggt ggctgctacc ccagccagtg 300 tcaacacgac accggacaaa ccaaagacag cctctgagca cagaaagtca tcaaagccta 360 tcatggagaa gaggcgccgg gcaagaataa atgaaagttt gagccaactg aaaacactga 420 ttttggatgc acttaagaaa gatagctccc ggcattccaa gctggagaag gcagacattc 480 tggaaatgac agtgaagcac ctccggaacc tgcagcgggc gcagatgacc gccgctctca 540 gcacagaccc gagcgtgttg gggaagtacc gcgccggctt cagcgagtgc atgaacgagg 600 tgacccgctt cctgtccacg tgtgagggcg ttaacaccga ggtgcgcact cggctgctgg 660 gccacctggc caactgcatg acccagatca acgccatgac ctaccccggg caggcgcacc 720 ccgccttgca ggcgccgccg ccgccgcccc cgtcaggacc tggcggtccc cagcacgcgc 780 cattcgcgcc gccgccgccg cttgtgccca tccccggggg cgcggcgccc cctcccggca 840 gcgcaccctg caagttgggc agccaggctg gagaggctgc caaggttttt ggcggcttcc 900 aagtggtgcc ggctcctgac ggccaatttg ctttcctcat ccccaatggg gccttcgccc 960 acagcggccc ggtcatcccg gtctacacca gcaacagcgg gacctcggtg ggtcctaacg 1020 cagtgtcgcc ttccagcggc tcctcgctca ctgcggactc catgtggagg ccgtggcgga 1080 actgaggggc tcaggccact gctaatcata aactccctag cccacctctc tcttctgacg 1140 gacactaaat acgaacttgg actttaggag agacttttat aagtcggtgg ttactttgtt 1200 gcttttttaa attctaaaaa gttacttttt gtagagagct gtattaagtg actgaccatg 1260 cactgcattt gtatatattt tatatgttca tattggattg cgcctttgta ttataaaagt 1320 tgagatgaca tttcgttttt tacacgagat ttcttttttt atgtgatgcc aaagatgttt 1380 gaaaaatgct cttaaaatat cttcctttgg ggaagtttat ttgagaaaat ataataaaag 1440 agtgaaggct ttt 1453 31 4831 DNA Rattus norvegicus 31 agcccgctcc ccgagcagcc ttagccatcc ctaactaatc caactgcttc ccagtctgcg 60 ggtgctgggc tcggtgctga gaagaggctc tggctcttgg ctattgtcag gtcaaacact 120 aagactgtgt ccatgttaga actcatagaa gttaatggaa cccctggcag tcagctctcc 180 actccgcgct ccggcaagtc accaagtcca tcgcccacca gcccaggaag cctgcggaag 240 cagagggacc tgtaccgccc tctctcgtcg gatgatttgg actcagtagg agactcagtg 300 taaaagacac agtggagcga tgtgtatggt ttgtgatggg atcaaggttt cccatttcct 360 aagcaagcat ctaacagcaa aatacacaaa agagttttgc ataagcaatt aaattataca 420 aaagcaagag tgcagacatt ttctactcgg cggtattctt tgaaagaaaa aaaaactttt 480 ttaaagcatc tcctccattt gatgtaaccg aggagagttt gaatacaata tttgttcatc 540 actgaggatt ttatgactaa ggtctggttc cagataaatt tcatatatga ctccaatatt 600 aagtacaaga atcaccctaa cgatccagca ttatggagtc accatttcta taaatgagcc 660 tgctttttaa tcagtaagta gatatataaa tacaattttt ctgtacttag tatcttttac 720 ctttttagtt taaatttttt taaaaaaatc agacagctaa ttagttattt cattgcaaaa 780 taacataaaa gaatctcttc aggatttttt atttgagtga atacgtctga aggggagtgg 840 aggaactccg taagagacgt ctcccagaga ccaacattag gcagcgagaa cgtttctggt 900 tttttttttt tttttttttt tgtttttgtt ttttgccaca tggattttct tttcccagat 960 ttttataaaa atagtttcaa ctacatggaa acatgaataa cttggcatta ttattaattt 1020 tataatggtt ccctacagca acaacagtat atatatcttt ttcaagtgtg taaagttttt 1080 tgacctaaaa caaatgggac ttgtactgtt tttcaaaaaa taaactcagt ttgagaattg 1140 gaattcttct acacagagct tgtacgttgg gcctccaggt agctaaactc cattctgatc 1200 atcctgcatc ccgttgtttg ggatatcttc acatgcaaaa ctccttttgt gtcaaaccga 1260 gtgtgtctat gattataacc cactgttaaa attaacctac taattcttct cctttagtcc 1320 tctactttca aaagcgtgag tcagctttgc ttaaatgtaa aacagttatt ttttaaatct 1380 agatgttaga aatcaatgtg tacggtgtat tgtatttaga ccatacccgt gaccgtttct 1440 ttttactagc tcaaacacta acaatttaag acttacaaag catgggtttt atattcgcca 1500 ggcttgacat atctcagtag ggtgacagat ttaaaatgta tggtgccatt tggagccttt 1560 tacttgagat ttctttcaaa tttcacttga aaacattttg aaaatgcctg attcaagatg 1620 gagaggagtt gccttgtcat acggtctcaa atgttattga tttatttctc tcagtagact 1680 gttcgactag caatttttcc catctggttt atcgttacct gtgctagcat ctcacttgac 1740 aagtgaataa aatattccaa ctgaaaaaag aaaaataaga cctccaggca acctaccagc 1800 atttgaaaag ttagccgagt gataattagt gcctcattac cctgcctgtt agagaggaga 1860 gggcacttct ttggcgtgtt atttgagaac tgcagactga tccctccatc ttgaaagcac 1920 agagaatgat catacctgtt ccagacttcg gattttaaaa ctgtttccaa tagacacgaa 1980 cagagactcc tgctgagatg aaagggccgc tttgtcttta gctgaaacag taaacatctg 2040 ggaggatgct gtgctgcctg ttggagtgtt acgtctttgt aagatgcatt gcttaatata 2100 gtccaacccg taaaagatta gtacagattt gactaacctg tcaaaaaatg caatgaaagc 2160 agtgggtcag atggtccaga acaaaacata aattccagta atttcaactc ccctatgaag 2220 tcaatcttat ggacgactac tgctaggaga aacctaataa atgttatctg ctataaaaat 2280 cgtggctttg tttagacaca gcatagagta gacttggact aattgccagt aggtattgtg 2340 gccaactttc cattcaagca gttccagaga gatgcaatcc catggcattt catcctggaa 2400 catacgattt ttattaaagg attgttcaag ggccatatat ctcatttact tttgaaacta 2460 aagtccctct gagattgaaa caacacaccc tgagatctct ttctatgcac ggaataattc 2520 tattatccaa aagcagagtt tttgaacatc caagcagccc accaggtcat tagtaaagac 2580 ccaatgggca attagcagtc actcaagacg tggctgcttt tagaatcctt ctaacatcat 2640 tagtcgctga actctaaaaa tgaccttctg aggagagagc tccagccaac aaaagccaca 2700 tgcttttctc tcactgataa agcctctgct gtgttgacct ttcaggcctt tgcatgtatg 2760 gttacatttt tttagacata cctagaaaga aagatgctat aaagagtccc ctgctttcag 2820 cattttatga gctaaaaccc tagtgatgca caaaatgtgt tgctctgcgt gaactccgaa 2880 gccatgtggg aaagtcattg tcatttcagc agaaggtctg tactcttgta ggtttctaaa 2940 ccggaagttt gtgctccacc tatatgtacg catgcctgca atgcacacat gaggtcacct 3000 taacatgtca agatctgatc ccgttgtctg ccaaccccag agcagctctg acacctatgc 3060 tcccctgcca agcagcgagg gcactcgccc tccccttcct tgcagccgca ccgagctgta 3120 cttttgtttc catttaaaag aactgaattc caaaagcctc gttttaaaaa agcaattggg 3180 aatgaaaata cataagaata atcctaaaag ataaatccat atatttaacc tgttgaacta 3240 ctaggttttt tttttctttt cctggtggcc atatatctat acaggtgtgt aggtgtgcct 3300 acataaatat ctgtgtttgt atatagaaca tgaagaataa aggatgatat ttgtgatatg 3360 gtcttataag ttactggctc ctgtagaatt aactgtgaag ctttatgtga taaagttgtg 3420 acgtatgtat tagttggcac atgtttaagc tcaaaacaag ttggctaaat tatcatgaag 3480 cagtaccaac tgttcacggg tacgaattta tccaagaccg attggaatcc agacaaacaa 3540 aaaaattgaa gtccaagcaa atccaatctt acagtgattg ataaatgtaa atactcattg 3600 atgttttcac tgtaaattct tgatgctcag ccagtaatct gacagaatgc agtgacactt 3660 tcctgttgtg tgtgtactga tgactacagc gattatcaat gggaaattca ccaaacaaac 3720 tgatttctag acttgtggac agttgagtgt tgtgctgaaa taaaagtaga aaaggtcaaa 3780 gtaaattcgg tttgaagcct tgttatgaag ccagacaaca tgtaaggcca tgctgtgtgt 3840 cctttttata aactgttttg gagacacttg ctgtgtgtgt gcatgtgtgt ctacatgagt 3900 gtatacacac ttgaaaatgc ttattgactc aagttcactg tctgtataca tctgtgccta 3960 tctgcattgg tctggccagc gcctctaact gtgcatggta gtgacagccc tgtgtgattg 4020 gatgaaccca cagaaaaata gtgtggaaat cagagctgcg tcaaactcat tataacaatt 4080 gcctgatata gcatattaaa ttgtcagctg aaaaacgttt gtaatgttgt gcctgtaaca 4140 tgtgtaggga aaactggaaa atctgaggcc ctcactataa ttttaagggt gcatgacatg 4200 tggtgtcttt aaagatgtat cttattgtaa taaaaggtga tggataaatt ctgaggtagc 4260 gtaagtgaag catgaaacgc taggtgtttg tattatatat ggccacacaa tatttttgaa 4320 ggcagggtag taaatgtttg aaggttaata ggcaaagaat atggaagtta attttactgt 4380 gatgtgtttt tgaggtgttt ttacacattg ccagacaagg aacttagttc ctgtttcccc 4440 cctgtggact ctttgaaaca ctatgaatgt tacaaagttg gattttcttt ataaaggtcg 4500 caaaccattt gcagttggag ttgcttggat gtatagtctc ctctttgggg aaaaaaacca 4560 ctttggtatt aaaatggttt aagcactaat acattgggaa atacgtcgtc tgaggccatc 4620 tgcagacatc tgtagagatg agaactaatg aaatgtgtta ctgaactacc cagttttgat 4680 agcttccatg cctctctgct gtagtctcct cgaaaatgct gtaccttgtg tttcacatgc 4740 aaccatgtta ctctctcccg aacttacctt tgtgacactt agcagcttag ccaaaccacg 4800 taataaacag gaagaactgc aaaaaaaaaa a 4831 32 666 DNA Rattus sp. misc_feature (1)..(666) where n may be a or g or c or t/u, unknown, or other 32 atgacagtgg tctttattgt ttaggtgaca ctgtttcatg atagacagtg aagacagagc 60 atgttaaatg ttggtatatc acattagtta attctgtgtg cagaggatct gtgtgcactg 120 tatcagctat atacagatta agaacataca tgacaaagta caaaagcaca agaagaggtt 180 aaatagggaa ttatttctgg gttataatca cctggtaaca tacaccttct gtcacacaac 240 tgtgtcactg ctaaaaacag taattagcat acatggtgat gatctctatt taatctagca 300 cagccctggt gtgaatacat aagggcatga taatatttca caggtacagg ctaagttctg 360 gggcatgaac tcttttgagt aattcataaa acacacctga ggcatatttt ttgtttttga 420 aaaaaaaaag caacgttctg acatagtcca agctttactg tgcaagggag acatttaata 480 aaaacatact tttgctgccc tgacaggagg tggatactct gatgaagatg gtgaccagca 540 aaagaaatcc ctctgtagca gaataaggca ggctttccaa ctaagccagt aatctattag 600 cttccgaata tgacaatngn tactctttcc tgacttgaat cctccttcct ttcactgggt 660 tctaaa 666 33 602 DNA Rattus norvegicus 33 gctgcacgca gtgcccacct atgctcgcca tgatgctcaa cactacgctc tctgcttgct 60 tcctgagcct gctggccctc acctctgcct gctacttcca gaactgccca agaggaggca 120 agagggccac atccgacatg gagctgagac agtgtctccc ctgcggccct ggcggcaaag 180 ggcgctgctt cgggccgagc atctgctgcg cggacgagct gggctgcttc ctgggcaccg 240 ccgaggcgct gcgctgccag gaggagaact acctgccctc gccctgccag tctggccaga 300 agccttgcgg aagcggaggc cgctgcgctg ccgcgggcat ctgctgcagc gatgagagct 360 gcgtggccga gcccgagtgt cgagagggtt ttttccgcct cacccgcgct cgggagcaga 420 gcaacgccac gcagctggac gggccagccc gggagctgct gcttaggctg gtacagctgg 480 ctgggacaca agagtccgtg gattctgcca agccccgggt ctactgagcc atcgcccccc 540 acgcctcccc cctacagcat ggaaaataaa cttttaaaaa atgcaaaaaa aaaaaaaaaa 600 aa 602 34 3101 DNA Rattus norvegicus 34 tgtctacacc ataattcctt tgtctttgag ccagctcaca aatgtcactg tggttctgag 60 tgtgggggtc ttggtgcagt ccctcccctc ccagtccctt ccgtcgagga gcatggtgct 120 agtgctgcca cagcctggag acgcacacaa ccccccaaaa tctctccaga cgaccgtccc 180 acgatcacag gacagaaccc tccaaatcga aacggaggaa acggacagcc attgaacatg 240 gacgaaggaa tccctcattt gcaagagaga cagttactgg aacataggga ttttatagga 300 ctggactatt cctctttgta tatgtgtaaa cccaaaagga gcttgaagcg agacgacacc 360 aaggatacct acaaattacc gcacagatta atagaaaaga agagacgaga ccgaattaat 420 gaatgtattg ctcagctgaa agacttactg cctgaacatc tgaaattgac aacactgggg 480 catctggaga aagcagtagt cttggaatta actttgaagc acttaaaagc tttaacagcc 540 ttaacggagc agcagcatca gaagataatt gctttacaga atggggagcg ctctctgaaa 600 tcgccggtcc aggccgactt ggatgcgttc cactcggggt ttcaaacctg cgccaaagaa 660 gtcttgcaat acctcgcgcg ctttgagagc tggacgccca gggaaccgcg ctgcgcacag 720 ctcgtcagcc acctgcacgc cgtggctacc cagcttctga cgccacaggt gaccccaggc 780 aggggccctg ggcgcgcgcc ctgcagcgct ggggctgcag ccgcctccgg ttccgagcgc 840 gtcgcccgct gcgtgccggt catccagcgg actcagcccg gcacggagcc cgagcacgac 900 acggacaccg acagcggcta tggaggcgag gcggagcagg gccgcgccgc cgtcaagcag 960 gagccacccg gggacccgtc gctgcgccca agaggctgaa gctggaggcg cgcggcgcgc 1020 tcctgggccc ggagcccgcg ctgctcggct ctctcgtggc gttgggcggg ggtgcgccct 1080 tcgcgcagcc cgccgccgcg cccttctgcc tgcccttcta cctgctgtcg ccgtccgccg 1140 ccgcctacgt acagccctgg ctagacaaga gtggcctgga caagtatctg taccccgcgg 1200 cggccgcgcc cttcccgctg ctgtatcccg gcatccccgc agcagccgcc gctgccgccg 1260 ccgccgcttt cccttgcttg tcgtccgtgc tatcgccacc cccggagaag gcaggttcgg 1320 ccgctggtgc cccattcctg gcgcacgagg tggcgccccc ggggtcgctg cgcccccagc 1380 acgcgcatag ccgcacccac ctgccgcacg ccgtgaaccc agagagctct caggaagatg 1440 ccacgcagcc ggccaaggac gccccctgaa cccagcattc cttccagaac agggcagggg 1500 gctcctgagg agtcgccagg tttccaagtt caaacatccc ctaaggcgta ccagggagga 1560 agagtaagag atgctctgct cgacaggctt aggacaaaaa caggtgtttt gtgtatgttt 1620 ggagttcctg ttttgcccct ttctcaccct tctgccaccc caccctctac cctttgacac 1680 tcccttcccc atccctgctg tcacagagcc tccctgagaa atactggtta tcttaaatta 1740 ccctccctta catttagttc acgtcctctg tttccaaaca tagaccctgg ttcaggagtc 1800 tgttgggtgg gagagccaca cggaaccagt tagagtgcct ggtatcaggg ctccttgacc 1860 caggcctgga acagtagctg tgtcccctgt ctgtcccctt aggaggtgac ccataactga 1920 gggtctctga aagttacatt gacgtgtcag tattttgtat tcttcagctt tttggaaggt 1980 acctcttttt caaagaagtg aggatgccat tgccctgttg tgaggtggct ggagtggtgt 2040 ctttatacct tgcacctgtt gggagaaact gagagttggg gccatcttca ggcactgtgt 2100 cagtgtggga gctggaagag ggagtttgga gcccgtggcg cctttctcgc actttattga 2160 caaattgacc tcaacccctt tgtcccatgt ctcaactcac agatatatgt cataggttat 2220 atatttgtgt ttctgatccc tcgttatttt atccatcatg gtcccaaatt tttgtaatgt 2280 tactggggtt tggggtgggg tggggtgtta aagtgctctg ggctggaaaa agacaagccc 2340 aaacctattg attgtcgaat tcttagatga cagaagtgga gagaggggct tgtggtccct 2400 tgtgatggga agtgctgtga acatgtagaa ggccctgcca gcctcgctct ctcaagtctg 2460 tatgtatttt tcgggagacc aaaccagaca ccagataatc aggaagaaag ctttttaaaa 2520 taaggcaaaa accgagacct tgtctagata tttttagttt gttgccaagg tagcactgag 2580 aaatctcact tgaatgttac ataaggagtg attcacaata gtctagagtg aagaaagtta 2640 tctgggtctg tgagtgttcg ggtccgtttg ctgctgctgt tgctactgtt tgcctcaaac 2700 gctgtgttta aacaacgtta aacttcttag cctaccaagg cggccgtatg tacatagctg 2760 ttaatacccc caactaatgt ctgacatgct atttttgtag ggagaagata cctgctagtg 2820 atattttgag ttaaaatatc ttttggggcg gacttggtga aatgtttgca ctttggtcac 2880 aatgcttcta ctgcttggtg caacgttacg ctgtcttaaa ttattaaaca aataaaaaat 2940 actatctgca agaaaaacca gctggtttag acaagtttag tatgtaaaga taagctagaa 3000 actatcttta tattctagta ttttcagcac tccatattac ctaaatattg ccacactatt 3060 ttgtgattta aaagttctta ctaaggaata aaatctttat a 3101 35 343 DNA Rattus norvegicus misc_feature (1)..(343) where n may be a or g or c or t/u, unknown, or other 35 tttttttttt tttttctggc tccttttctg gctctgggag gagctttgct caaaagggac 60 accacctatc cttagcatgc ttctcttgag gtacagtatg cacaaccaat aggagaccca 120 agtcaataat atataaaagg tgcttaaaaa aaaaaaagca aacagtaaca cacacgaaga 180 aatcaaccaa aaattggtgg acatctgttt tttattataa tatagattct gaatatttta 240 aggaataaag agttattgtt ttattacatt gccctctaat ctgtatggaa taaaattatg 300 actttgtnaa acaaaaaaaa aaaaaaaaaa aaacctcgtg ccg 343 36 619 DNA Rattus norvegicus 36 gcggccatgg agcccgacaa cagtccacgg aagatccagt ttacggttcc gctgctggag 60 cctcacctgg acccggaggc agccgagcag attcggaggc gccgccccac ccctgccaca 120 cttgtgctga ccagcgacca gtcatcccca gaagtagatg aagaccggat ccccaaccca 180 cttctcaagt ccacactgtc aatgtctcca cggcaacgga agaagatgac aaggactaca 240 cccaccatga aagagctcca gacaatggtt gaacatcacc tagggcaaca gaaacaaggg 300 gaagaacctg agggagccac tgagagcaca gggaaccagg agtcctgccc acctgggatc 360 ccagacacag gctcagcgtc aaggccagat acctcgggga cagcacaaaa gcctgcagaa 420 tccaaaccca agactcagga gcagcgtggt gtggagccca gcacagagga cctttcagcc 480 cacatgctac cactggattc ccaaggagcc agcttggtct gacagaagtt gacatccggg 540 gatcgccagt gagtgtggaa gttcatggac actggatgtt tcttaatctc ttgtttttaa 600 acgtgataaa tttggtgtt 619 37 4614 DNA Rattus norvegicus 37 aattccgtgg gcggcgggcg gcggagaagc tccgagcggg gccgggcggc ggcagattgg 60 agtcgcgaca cagccgccgt ccgcttcggg ctccacgtag aaggaaccat gctggccgtc 120 ggctgcaccc tgctggtcgc cctgctggcc gcgcccgcag tcgcgctggt ccttgggagc 180 tgccgcgcgc tggaggtggc aaatggtacg gtgacgagcc tgccaggggc cactgttacc 240 ctgatctgcc ctgggaagga agcagcaggc aatgctacca ttcactgggt gtactcaggc 300 tcacagagca gagaatggac taccacggga aacacactgg ttctgagggc cgtgcaggtc 360 aatgacactg ggcactattt gtgcttcctg gatgatcatc tggttgggac tgtgcccttg 420 ctggtggatg ttcccccaga ggagcccaag ctctcctgct tccggaagaa cccccttgta 480 aatgcctttt gtgagtggca tccaagcagc actccctctc caaccacgaa ggctgtgatg 540 tttgcaaaga aaatcaacac caccaatggg aagagtgact tccaggtgcc ttgccagtat 600 tctcagcagc tgaaaagctt ctcctgcgag gtggagatcc tggagggtga caaagtgtac 660 cacatagtgt cactgtgcgt tgcaaacagt gtcggaagca ggtccagcca caatgtagta 720 tttcagagtt taaaaatggt gcagccggat ccacctgcca accttgtggt atcagccata 780 cctggaagcc tcgttggctc aaagtcagtt ggcaagaccc tgagtcctgg gacccaagtt 840 actacttgtt gcaattcgag cttcgatacc gacctgtatg gtcaaagaac gttcacggtg 900 tggccgctcc aggtggccca gcatcaatgt gtcatccatg atgccttgcg aggagtaaag 960 catgtggtgc aggtccgagg gaaggaggag tttgacattg gccagtggag caaatggtcc 1020 ccggaggtca caggcactcc ttggctagca gagcccagga ccactccggc agggatcccg 1080 gggaacccca cacaggtctc tgttgaagac tatgacaacc acgaggatca gtacggaagt 1140 tctacagaag caacgagtgt cctcgcccca gtgcaaggat cctcgcctat acccctgccc 1200 acattcctgg tagctggagg aagcctggcg tttggattgc ttctctgtgt cttcatcatc 1260 ttgagactca agaagaaatg gaagtcacag gctgagaagg aaagcaagac gacttctccc 1320 ccaccgtatc ccttgggacc gctgaagccg accttcctcc tggttcctct cctcacccca 1380 tcagggtccc ataacagctc tgggactgac aacaccggaa gccacagctg cctgggtgtc 1440 agggacccac agtgccctaa tgacaacagc aacagagact acttattccc cagataattg 1500 tctggagggt acctggcagc tggcacgcaa gtttctcact gccggccccg tccaccaggg 1560 ctgggggcgg ggtgggcggg gctgcagctt cacgatccca caggagcctt gcaaaggttc 1620 tgagtgggag aagactggtg tgctgcacgg gcttcgaaag aaggggctgt gaggagcacg 1680 agccatcatg aagagagcct gtgatgactc tgaatagaga cgcccgccca tcagctacac 1740 acctgatggt ggctctcaag ctatcctctc aggaagcctc tgggaggggc gacaaaggct 1800 gccccagttg cctagctctg gctcactggc ccaagctgcc ttttagcttg aactcctaaa 1860 atccaagcac cttggccatt ctcttcctag gccaccgagg ccgcggggaa gcttggttct 1920 actttccttc tcaacacctg gagaagcagc tgcccggtgg tggtgactaa cgtatcaggg 1980 cctgatggct tatgaggaat gacaattaat tcctcataag cagtttttaa atgtgaatag 2040 taatcctagg cactgctgac ttgaggtttt attttcttcg atctcaggac ttcaggagag 2100 aagcagagca gaagtagaga gaggatgggt gtccattgtc cgtgtggtac ttgaggggga 2160 tacagcctgg aaaacacgtt tcctgtcccc ctactctccc agaagaggta gggggtggcg 2220 cctcttccag ggcagagagt ataactactt tacctggcct tgcccatact ggtttcaact 2280 ggacttgagc tactaggaaa aatgacattc atgcaaaaag aaaactttaa ctagcaagaa 2340 tgcacttcca ctttggtttc tagaggactg ttgctcctct tgagacgctg gaagaggccg 2400 ctcactgtac cctggtgtat gagccctcac cccccacccc agggtaagtg cagtaacttt 2460 agtctaaaca ccgagtcagg taaaaatcga ggaaaaaaca accctgtttc ctgtaacaga 2520 aaagcctttg gtttcgtttt gtattttgat tttttttttg tcttaaaaag tgtaaaaata 2580 gtctgtccat actctgcttc agggaatgac ctgtgaatac tccccaggcg tgggcaggaa 2640 gggtgtctgc ttcctgctac acctcactgc cacctcggcc ttccttgctt tacattcaac 2700 tgagttgcct cagctgcttt cccctggggc gctgaaaaag ccagtgatgt tggtggtcac 2760 cgagaagacc acagagccac agagtaatgc tgtgattgaa gcgagttacg caaccacagc 2820 accccacatt tgctgtatta tagaactatg ctaggagctt gccttttcac aaaataccac 2880 caccacgaga cgtggcagag ctcggaagct gtcaccttgt gccatctgct tgccagctcc 2940 aaggggccac tgacttaagc agttattttc tttgtgggct ttgttcattt cagggcctgt 3000 tgctgtctta gaaaaagctc tgtcggttga caaaaacatc agacaggtag tcatgtttat 3060 ttattttttt tccttctttg ctaagtcttt gggactcaag ggtagtaaaa aatgctgtga 3120 aaagggaaac attagaaaca gcgatcttcg gggaataggt gactgtgccc acgcactgtt 3180 cttcagtccc tcacgtggct ctgcccgagt gctgttccaa gccaggcaga gcaggctggc 3240 ggaagattga aatccagata gctcgttatc tctgagagct aaatagcttt gatctccaag 3300 ctgttattgc tttcactatt gtaacaggat agcctccccc cccatgtcaa aaggatgctt 3360 ttcccttttg actttttata agctaagtca gtgaagtctg tttcatctga gctccagctt 3420 cgttcagttc gcacaggtgt atgccctcag ctgcttcggg cctcagatct gtgctagttg 3480 aatggttgtc ccatccttgg gtcatcctta ccagagtttc tgcagcccac aggtctgcct 3540 tgtcaacagt accacttaac accagcattc agtgcccagg cagccagatg tggagggttt 3600 acccagagat gatttaaaca tgaccttaaa cgtgtatggt agaacgaggg gaacccatac 3660 cagctcaggt tctaaagaga tctttgattc ttctggcatt agtgaaatag ctttaaacta 3720 tttcaaggaa gaagccttgg ccacacccac gacatttggt gacaatcctt tctctccatg 3780 agccttgtct ttacaccttc tcacctggct gaaagctcac actgaatctt tcctatgtcc 3840 ctggtgtctt gggagaaagg aaactggtat gggcttcact gctggaattg gcttggagcc 3900 agcgtgtggc gcagccctgg cagggtgggc caggcttagt tatggtgtgc tggtttaagg 3960 aatgcctggc ttgcctggtt gcttgggttc tgagctgcag agtttcctag cagttcttta 4020 tggctgacct agttggggaa gattcccaca ctcaactgca ggtggaggtg gtgagaaagc 4080 tgttttcatt tggagaggca ggatcagccc aagaagcttt cagtgggaga gcctacagtg 4140 aggctgtacc tcactgtggg aggaggcagg ccagctggct caggtcctgg gactggcact 4200 ggggagggtc tgccaaaggt ccctccagcc tgtagtccta gcatagtcgg gtgccagttc 4260 caggaagttt ctatggcaac cttagtgctc attaaggaac attgtcagtt ttgtgaacat 4320 atgctcagat ggagatcttg ttttcagaga aaggactggt acagtgtgta acaagctgga 4380 gcagacagag agactttttg gcaagagatc acatccgtta agcagaatac ctcagtgcta 4440 catgtttttg tctttgagac aatgttttta aggtttttat gctctgttac ctgtaagctg 4500 atacctaaaa ctttctgcaa agtcagggtt tttcaatgcc tttttttttt ttttgccatt 4560 gtttgcttta aagtgaagat tgtaactgtt tgaaataaat aatttctaaa actg 4614 38 2952 DNA Rattus norvegicus 38 tgctgggcga ggggacacac tgactgttat aataacacta catcagcaac tcctggctcc 60 ccaacagccg gatcccaggc aggagagagt cagtggcaga tagccatttt tttttcttcc 120 ttaagaagcc aacaacttgg ttgctagttt tatttctgtt agaatttttt ttttgtgtgt 180 gtgtggatgt gtggtggtgg tggtcttttc taagtgtgga gggcaaaagg agataccatc 240 ccaggctcag tccaacccct ctccaaaaac ggcttctctg gcactccagg tagcgaggga 300 gttgggtctc caggttgtgc gaggagcaaa tgatgaccgc caaggccgta gacaaaatcc 360 cagtaactct cagtggtttt atgcaccagc tgcctgacag cctctacccg gtggaagacc 420 tcgccgcccc gtcggtgacc atcttcccca atggtgaact gggaggcccc tttgatcaga 480 tgaacggagt ggctggagat ggcatgatca acattgacat gaccggagag aagaggccct 540 tggatctccc atatccgagt agcttcgctc ccatctctgc gcctagaaac cagaccttca 600 cttacatggg caaattctcc attgaccctc agtaccctgg tgccagctgc tacccagaag 660 gcatcatcaa tattgtgagt gcgggcatct tgcaaggggt cacccctcca gcttcaacca 720 cagcctcttc cagcgtcacc tctgcctccc ccaacccact ggccacggga cccctgggtg 780 tgtgtaccat gtcccagact cagcctgaac tggaccacct ctactctcca ccaccacctc 840 ctcctcctta ttcgggctgt acaggagacc tctaccagga tccttcagca ttcttatcgc 900 cgccacccac cacttccacc tcctctctgg cctaccagcc acctccttcc tacccatccc 960 ccaagccggc tatggaccca ggtctcattc ctatgatccc agactatcct ggattttttc 1020 catctccgtg ccagagagat ccacatggcg cggctggccc tgatcgaaag ccgttcccct 1080 gccctctgga ctccctgcgg gtcccccctc cgctcacgcc actctccacc atccgtaatt 1140 ttactcttgg ggggcccagt gctggcgtca cgggaccagg ggcaagtgga ggcggtgagg 1200 gtccccgact gcctggcagt gggtctgcag cagtgactgc taccccctac aatccgcacc 1260 acctgccttt aaggcccatc ctgagacctc gaaagtaccc caacaggccc agcaagacgc 1320 cagtgcacga aaggccgtac ccctgcccag cagagggttg cgacaggagg ttctcacgct 1380 ctgatgagct gaccaggcac atccgaatcc acaccggtca caagcccttc cagtgtcgga 1440 tctgcatgcg aaacttcagc cgaagtgacc acctcaccac tcacatccgc acccacaccg 1500 gggagaagcc ctttgcctgt gactattgtg gccgtaagtt tgcccggagt gacgaaagga 1560 agcgccacac caagatccac cttcgccaga aggagaggaa gagcagtgcc ccctcgtcat 1620 ctgcatccgc ccagtcttca gcctctggtc ctgggggctc gcaggccggg ggcagcctgt 1680 gcggtaacag cgccattgga ggaccactgg cctcctgcac ctctcggacc aggaccccgt 1740 gagatgaagc tcccgctgac acaccagttt cttcaggccc cagaggccct ctatccactc 1800 gagctgcaaa cactaccgcc cttctgtgtt cttccccgtg atcccgtgaa cctgtgatcc 1860 tgggcaaagg accctaatgg agcccagctc tgtcccacct tctcacagac ggccttctga 1920 aaacttaggc cattttaaag gagttgactg tcactccaag aaatggggag ccagaagagg 1980 gctgggcgag ggcccctggc ctacagggct gtgctctaac cctgacagag agatgtttga 2040 ctatggtctg cgagcccttc cctttgaccc tcgatgccag ttgctctgag actttttcta 2100 caataggttg ggagttgttg attcctttga gcaaggacag cgaaaaagac taaattaaag 2160 caaaaccgat gtggcacttt aatggcttgg gacggacttg gggtaggggt ggggggttgt 2220 acagtgagca cagtttagcc ctggcctggc cgctgcactc tgtggcccta gaacagtgaa 2280 tggaagtttc tcgagccatc tcaaccctta agcaatatgt cctataaact caagagaacg 2340 aacggaagtg caatgtcggg gaggacaaag ccaatattgg ctcctttttt ttgagaaaca 2400 aagattattt tccagtgtat atccatttag atttttgtgt attttttctt tctggatgtg 2460 cactgtttct ccgagttctg aacctttggg aaaaaagtgt aaaacattta tgatctcttg 2520 aatcgagtca aaagttaact tatttaaagg ggggtgtaca taggatgcat gcagtggtgt 2580 tgcaagtgtc ctctgtgcct tgtgtgatgt gggcagtgtt acagggtctg catgtgtaca 2640 ggatgcctta ctatgaaaaa aaaatcactc cctgggttta agtatggctg tatatttctg 2700 cctattaata tttggaattt ttttagaaag tatatttttg tatgctctgt tttgtgactt 2760 gaaagtgtta ccttcgcagt caaatttcag atgagagtgt gcttaacgtc actgcagctg 2820 acttgtttgg ttattagctc ttaatagttg tggaaagatt aaacaatcta ttctaacaca 2880 gaaccactaa ctggagttca gatatcggac ggcttatggc aatggtgtaa aataatactt 2940 ttcaacaata aa 2952 39 2032 DNA Rattus norvegicus 39 gaattccccc ggttcttctc tctaggtccc ctatctccgc cccgggcctg aggggcgcac 60 cgaccgccac catgagttcg ttcagctacg agccgtactt ttcgacctcc tacaagcggc 120 gctacgtgga gacgccccgg gtgcacatct ccagcgtgcg cagcggctac agcacggcgc 180 gctctgcgta ctccagctac tccgcgcccg tctcctcctc tctgtcggtg cgccgcagct 240 actcatccag ctccggctct ttgatgccca gcctggagaa cctcgatctg agccaggtag 300 ccgccatcag caacgacctc aagtctatcc gcacacagga gaaggcacag ctgcaggacc 360 tcaacgatcg cttcgccagc ttcatcgagc gcgtgcacga gctggagcag cagaacaagg 420 tcctggaagc cgagctgttg gtgctgcgcc agaagcactc agagccttcc cgcttccgcg 480 ccctgtatga gcaggagatc cgtgatctgc gactggcggc cgaagacgcc actaacgaga 540 agcaggcgct gcagggcgag cgcgaggggc tggaggagac tctgcgcaac ctgcaggctc 600 gctacgagga ggaggtgctg agccgcgagg acgccgaggg ccggctgatg gaagcccgca 660 aaggcgcgga tgaggctgcg ctcgcccgcg ccgagctgga gaagcgcatc gacagcctga 720 tggacgagat agccttcctg aaaaaggtgc acgaggaaga gatcgccgag ctgcaggctc 780 agatccagta tgctcagatc tccgtggaga tggacgtgtc ctccaagccc gacctctccg 840 ccgctctcaa ggacatccgc gctcagtacg agaagctggc cgccaagaat atgcagaatg 900 ccgaagagtg gttcaagagc cgcttcacgg tgctaaccga gagcgccgcc aagaacaccg 960 acgcagtgcg cgctgccaag gacgaggtgt cggaaagccg ccgcctactt aaggctaaga 1020 ccctggagat cgaagcctgc cggggtatga acgaagctct agagaagcag ctgcaggagc 1080 tggaggacaa gcagaatgca gacatcagcg ccatgcagga cacaatcaac aaactggaga 1140 atgagctgcg aagcacgaag agcgagatgg ccaggtacct gaaggagtac caggacctcc 1200 tcaatgtcaa gatggcattg gacattgaga ttgcagctta caggaaactc ttggaaggcg 1260 aagaaaccag gctcagtttc accagcgtgg gtagcataac cagcggctac tctcagagct 1320 cgcaggtctt tggccgttct gcttacagtg gcttgcagag cagctcctac ttgatgtctg 1380 ctcgagcatt cccagcctac tataccagcc acgtccagga agagcagtca gaggtggagg 1440 agaccattga ggctacgaaa gctgaggagg ccaaggatga gcccccctct gaaggagaag 1500 cagaagagga ggagaaggag aaagaggagg gggaggaaga ggaaggtgct gaggaggaag 1560 aagctgccaa ggatgagtct gaagatgcca aagaagaaga aggtggtgag ggtgaagagg 1620 aagacaccaa agagtcagaa gaggaagaga agaaagagga gagtgctggg gaggagcaag 1680 ctgctaagaa gaaagattga gccccattcc caactatccc aggaaaaaag tctccccaaa 1740 tcaggtcaac ctcatcacca aaccaaccag ttgagttcca gatcctatac agattaagaa 1800 gtcaatatat gtataattct gagatgactt aggttggaca ttcaatgttg tgctatgact 1860 ttcctcctta tgcagagtat ctgtttgctt gcagagtggc tttctggctt gctgccaacc 1920 tgtgcatggt ccatgcttat gagttcagga tctacggcaa tgtgaatcac acagatgttt 1980 acaataataa taataaaaaa accacacata cacaacatga ataaatgaat tc 2032 40 3070 DNA Rattus norvegicus 40 cggacgcgtg ggcagccaca caccccaagg cctccaagat gagctacacg ctggactcgc 60 tgggcaaccc gtccgcctac cggcgcgtca ccgagacccc gtccagcttc agtcgtgtga 120 gcggttcccc gtccagcggc ttccgctcgc agtcctggtc ccgcggctcg cccagcaccg 180 tgtcctcctc ctacaagcgc agcgcgctcg ccccgcgcct cgcctacagc tcggctatgc 240 tcagctcggc cgagagcagc ctcgacttca gccagtcctc ttcgctgctt aacggcggct 300 ccggcggcga ctacaagctg tcccgctcaa acgagaaaga gcagctgcag gggctgaacg 360 accgtttcgc cggctacatc gagaaagtgc actacttgga acaacagaac aaggagatcg 420 aggcagagat ccacgcgctg cggcagaagc aggcctcgca cgcccagctg ggtgacgctt 480 acgaccagga gatccgagag ctgcgcgcca ccctggagat ggtgaatcac gagaaggctc 540 aagtgcagct ggactctgat cacttggagg aagacatcca ccggctcaag gagcgcttcg 600 aggaggaggc gcggctgcgg gacgacaccg aggctgccat ccgggcgctg cgcaaagaca 660 tagaggagtc gtcgatggtt aaggtggagc tggacaagaa ggtgcagtcg ctgcaggatg 720 aggtggcctt cctgcggagc aatcacgaag aggaggtggc cgacctgctg gcccagatcc 780 aggcgtcgca catcaccgta gagcgcaaag actacctgaa gacagacatc tccacggcgc 840 tgaaagagat ccgctcccag ctcgagtgtc actccgacca gaacatgcac caggccgaag 900 agtggttcaa atgccgctac gccaagctca ccgaggcggc cgagcagaac aaggaggcca 960 tccgctccgc taaagaagag atcgccgagt accggcgcca gctgcagtcc aagagcattg 1020 agctcgagtc ggtgcgaggc actaaggagt ccctggaacg gcagctcagc gacatcgagg 1080 agcgccacaa ccacgacctc agcagctacc aggacaccat ccagcagctg gaaaatgagc 1140 ttcggggaac aaagtgggaa atggctcgtc atttgcgaga ataccaggat ctccttaacg 1200 tcaagatggc tctggacatc gagatcgccg catataggaa actactggag ggtgaagaga 1260 ccagatttag cacattttca ggaagcatca ctgggcctct gtacacacac cgacagccct 1320 cagtcacaat atccagtaag attcagaaga ccaaagtcga ggcccccaag ctcaaggtcc 1380 aacacaaatt tgtggaggag atcattgagg agactaaagt ggaagatgag aagtcagaaa 1440 tggaagacgc cctcacagtc attgcagagg aattggcagc ctctgccaaa gaggagaaag 1500 aagaggcaga agaaaaggaa gaggaaccgg aagttgaaaa gtctcccgtg aagtctcctg 1560 aggctaagga agaggaggaa ggggaaaagg aggaagaaga ggaaggccaa gaggaagaag 1620 aggaggaaga tgaaggtgtc aagtcagacc aggcagaaga gggaggatct gagaaggaag 1680 gctcgagtga aaaggatgaa ggtgagcaag aagaagaagg ggaaactgag gcagaaggtg 1740 aaggagagga agcagaagct aaggaggaaa agaaaacaga gggaaaggtc gaggaaatgg 1800 ctatcaagga ggaaatcaag gtcgagaagc ccgagaaagc caagtcccct gtgccaaaat 1860 cacccgtgga agaagtaaag ccaaaaccag aagccaaagc cggaaaggat gagcagaagg 1920 aggaagagaa agttgaggag aagaaggagg tagccaagga atcacccaag gaagagaagg 1980 tggagaaaaa ggaggagaag ccaaaagatg tcccagataa aaagaaggct gagtccccag 2040 tgaaagaaaa ggccgtagag gaaatgatca ccattactaa gtcggtaaag gtgagcctgg 2100 agaaagacac caaagaggag aagcctcagc agcaggagaa ggtgaaggag aaggcagagg 2160 aggagggggg tagtgaggag gaagtgggtg acaaaagccc gcaagaatcc aagaaggaag 2220 acatagctat caatggggag gtggaaggaa aagaggagga ggagcaggaa actcaggaga 2280 agggcagtgg gcaagaggag gagaaagggg tggtcactaa tggcttagat gtgagccctg 2340 cggaggaaaa gaaaggggag gatagaagtg atgacaaagt ggtggtgacc aagaaggtag 2400 aaaaaatcac cagcgaggga ggcgatggtg ctaccaaata catcaccaaa tctgttactg 2460 tcactcaaaa ggttgaagag catgaggaga cctttgagga gaagctggtg tcaactaaaa 2520 aggtagaaaa ggtcacttca catgccatag tcaaggaagt cacccagggt gactaagatc 2580 ggagtccatt gcaaaaggtt aagccataca acaatttcaa aatgcatgtg attgacagct 2640 tcaaaacaga atgggttctc ccatgagggc tccagacatt gtattttcct ttgtgcaata 2700 tgagggaact gcatgcaagc tcagggtgcc cccctcctca gtccttgggg gaattcaaat 2760 gcatgttcat gtatgtacct agggaatttg ccagtttcct aagctgttga aagaggggca 2820 ctcgggggga tgtcttgaga tgtattacgc aaagtaccaa ctgagccaaa aataataagt 2880 gaaacagaac tctcttagcc ttaagaaagc tatatatgaa tacttatgtt tacctcactg 2940 gtgcatttaa aatggacttc agttcatggg agaaccttgc tgacctgcac agttcgcaac 3000 cttatgttga ttgatgttaa atgtcacagc agttcttgct caataaaggt catactggaa 3060 ccataaaaaa 3070 41 496 DNA Rattus sp. 41 agcaaaactg gtaaaaacaa aattgtaatc gttgaacata gcgctctggc aatcaagacg 60 tttgaaaccg tcaatcttct ggggcgaaga aagcactgtg cgacacttag aactctgatt 120 aacagacaag gtggtcacaa attttcctgg cttgaagact tccacaactt tcctgatcag 180 gtcatcatag gaggtctgac ttaggtttgt ttcaaagcta acataagaaa attctggttc 240 tggagtgatg tgaatattcc aatatgttcc atccgatttc attccattca ttgagtagcc 300 acaaggattg aacagtgtgg catcaatgac agaacctggt atcaggtcac gaattccact 360 ctcacgagtg acatcctttg cagtaacacc atctttcatg tagaactggt ccataactgc 420 tgggtcaagc tcactcatca gaatttccac ggtttgatct ggctgattga ttactcggct 480 ctctgggagc ctcgtg 496 42 458 DNA Rattus norvegicus 42 tttttttttt tttttctgaa gtaaatagag acatgtagag tcttccctcc atgtcaggct 60 gcacttcatt agccccagct cagtaatgca gggaacccta gtgacccatc ggccaagaaa 120 ctcccagaag cattaaaaaa aaagttatat tccgctgcca agtggatagt catttagctg 180 tttgtccctt gttttttatt tattccataa ttatgtttgt gctttttctt gtgtgaacag 240 tagtgaggcg tatgttttta tgtggcttta gagaaaactt cagtcttcaa agaactgttc 300 taattagttc cttctcggaa aaagttatgc gttaatttgt ttcaaaatat ttaggcattc 360 tttgaattat aaacttgtga tgcagggatt tgtgaacgag acgttcacaa gtgaagatga 420 cttcacttag catctgtgta aacagaataa gatgtgta 458 43 4757 DNA Rattus norvegicus 43 atctgtgtgc gagtgcgtgt gcgtgcgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 60 gtgtgagcgt gtgtgttttt ggatttcata ctaattttct ggagtttctg cccctgctct 120 gcgtcagccc tcacgtcact tcgccagcag tagcagaggc ggcggcggcc gccggttaga 180 gcccagtcgc tgcttcagct gctgttgctg cttctgcggc gctctgctcc ctgcgctggc 240 tacgggaggc cgggggagcc gcgccgacag tcctctgtgg ccagggccgg cactgtcctg 300 ctaccgcagt tgctccccag ccctgaggtg cgcaccgata tcgatatccg tgccggttta 360 gcggttctgc gacccaaaga gtccagggag agccaccgag tggcgcctgg cgtataggac 420 catgcagccg ccttgtggct tggagcagcg gcccgtgatg ttccagccac tgtgaaccat 480 ttggtcagcg ccaacctgct cagccccagc accgacaggc tcagcctctg gtacgctcct 540 ctcggcggga ggccatcagc accaagcagc aagagggctc agggaaggcc tcccccctcc 600 ggcgggggac gcctggctca gcgtagggac acgcactctg actgactggc actggcagct 660 cgggatgtcg ccctggccga ggtggcatgg acccgccatg gcgcggctct ggggcttatg 720 cttgctggtc ttgggcttct ggagggcttc tcttgcctgc cccatgtcct gcaaatgcag 780 caccactagg atttggtgta ccgagccttc tcctggcatc gtggcatttc cgaggttgga 840 acctaacagc attgacccag agaacatcac cgaaattctc attgcaaacc agaaaaggtt 900 agaaatcatc aatgaagatg atgtcgaagc ttacgtgggg ctgaaaaacc ttacaattgt 960 ggattccggc ttaaagtttg tggcttacaa ggcgtttctg aagaacggca acctgcggca 1020 catcaatttc actcgaaaca agctgacgag tttgtccagg agacatttcc gccaccttga 1080 cttgtctgac ctgatcctga cgggtaatcc gttcacgtgt tcctgtgaca tcatgtggct 1140 caagactctc caggagacga aatccagccc cgacactcag gatttgtatt gcctcaatga 1200 gagcagcaag aatacccctc tggcgaacct gcagattccc aattgtggtc tgccgtctgc 1260 acgtctggcc gctcctaacc tcacggtgga ggaagggaag tctgtgacca tttcctgcag 1320 cgtcgggggt gacccgctcc ccaccttgta ctgggacgtt gggaatttgg tttccaaaca 1380 catgaatgaa acaagccaca cacagggctc cttaaggata acaaacattt catcggatga 1440 cagtgggaaa caaatctctt gtgtggcaga aaacctcgtc ggagaagatc aagactctgt 1500 gaacctcact gtgcattttg caccaaccat cacatttctc gaatctccaa cctcagacca 1560 ccactggtgc atcccattca ctgtgagagg caaccccaag ccagcacttc agtggttcta 1620 caacggagcc atactgaatg aatccaagta catctgtacc aaaatacacg tcaccaatca 1680 cacggagtac cacggctgcc tccagctgga taaccccact catatgaata atggagacta 1740 caccctaatg gccaagaatg aatatgggaa ggacgagaga cagatttctg ctcacttcat 1800 gggccggcct ggagttgact atgagacaaa cccaaattac cctgaagtcc tctatgaaga 1860 ctggaccacg ccaactgaca tcggggatac tacaaacaaa agtaatgaga tcccctccac 1920 ggatgttgct gaccaaacca atcgggagca tctctcggtc tatgctgtgg tggtgattgc 1980 ctctgtggta ggattctgcc tgctggtgat gctgcttctg ctcaagttgg cgagacattc 2040 caagtttggc atgaaaggcc cagcttccgt catcagcaac gacgatgact ctgccagccc 2100 tctccaccac atctccaacg ggagcaacac tccgtcttct tcggagggcg ggcccgatgc 2160 tgtcatcatt gggatgacca agatccctgt cattgaaaac ccccagtact tcggtatcac 2220 caacagccag ctcaagccgg acacatttgt tcagcacatc aagagacaca acatcgttct 2280 gaagagggag cttggagaag gagcctttgg gaaagttttc ctagcggagt gctataacct 2340 ctgccccgag caggataaga tcctggtggc cgtgaagacg ctgaaggacg ccagcgacaa 2400 tgctcgcaag gactttcatc gcgaagccga gctgctgacc aacctccagc acgagcacat 2460 tgtcaagttc tacggtgtct gtgtggaggg cgacccactc atcatggtct ttgagtacat 2520 gaagcacggg gacctcaaca agttccttag ggcacacggg ccagatgcag tgctgatggc 2580 agagggtaac ccgcccaccg agctgacgca gtcgcagatg ctgcacatcg ctcagcaaat 2640 cgcagcaggc atggtctacc tggcatccca acacttcgtg caccgagacc tggccacccg 2700 gaactgcttg gtaggagaga acctgctggt gaaaattggg gacttcggga tgtcccggga 2760 tgtatacagc accgactact accgggttgg tggccacaca atgttgccca tccgatggat 2820 gcctccagag agcatcatgt acaggaaatt caccaccgag agtgacgtct ggagcctggg 2880 agttgtgttg tgggagatct tcacctacgg caagcagccc tggtatcagc tatcaaacaa 2940 cgaggtgata gaatgcatca cccagggcag agtccttcag cggcctcgca cgtgtcccca 3000 ggaggtgtac gagctgatgc tgggatgctg gcagcgggaa ccacacacaa ggaagaacat 3060 caagaacatc cacacactcc ttcagaactt ggcgaaggcg tcgcccgtct acctggacat 3120 cctaggctag actccctctt ctcccagacg gcccttccca aggcacccct cagacctctt 3180 aactgccgct gatgtcacca ccttgctgtc cttcgctctg acagtgttaa caagacaagg 3240 agcggctctc cggggtgagg cagtgcgcac ttccccatcc acagacagta tcgactcgct 3300 tctggctttg tcgctttctc tccctttggt ttgtttcttt cttttgccca ttctccattt 3360 atttatttat ttatttattt atttatttat ttatttattt atttatctat ctatctatct 3420 atttatttat ttatttattg gtcttcactg cttcatggtc ctcggcctct ctccttgacc 3480 gatctggctt ctgtactcct attcactgta catagacaaa ggccttaaca aacctgattt 3540 gttatatcag cagacactcc agtttgccca ccacaactaa caatgccttg ttgtattcct 3600 gcctttgatg tggatgaaaa aaagggaaaa aaaataatca aacatctgac ttaaaccgtc 3660 acttccgatg tacagacacg gggcgtttct atggattcac ttctatctat ctatttattt 3720 atttatctat ttatttattt ctcttctttg ttgttttccg gtggttttag cctgtgtatg 3780 agaagggaaa gtcatgtaca gtctgggaaa actttatctg tgggaaatgg aaaccagaag 3840 gggaaagaag ctttaccata aagcacagca ggagtgagac acagaaaagc cattggatca 3900 gccagagtcc gtcctgcata ggaaaaccca gcagccatca ggctggagga tcatgttcgg 3960 cactgacccc cgaggacctt tctgaggagg acacagaatg ttaaactctg catcatggac 4020 acagtttccg atcacagata ctggccttca atggaaaaaa aaaaaaaaaa aacccagata 4080 gttcttgtga gacctggaca gcacgtccaa catccagaca ttgtggtcgg gcacagtgac 4140 agagttgatg catttctcac gggttattct acagagcttt tgtcaagtcc aatggaagga 4200 ggtagattct tgttcagata tgatttcggg aaaaaccgag tccttgacaa agacaggaga 4260 caccctcagt tgggaggcaa gtttctctta ccttggactt tctcacacag caattctcac 4320 ccccaccccc tccactctca cctgtcttgt aactgtgcaa acaaaagtgt gcatggtctt 4380 tgtcagttga tacctttgtg cacctctgtg cagaaactgc tgtctgtccc ggctgtggta 4440 cccgatcagt ggggtagatc cacgaaaggt ctcattttag gccgctttgg gaaggtaacc 4500 agatcggtag ctggaagcac tctccagtag gtggcgaagg gtgagtgggt ctgctgaagc 4560 ctgcatatct tcacccacct caaacccacc gggctgcaca ggggacaggc acaggccacc 4620 cctgagggac agggaagctc tcttgggata ccacctgagt ttacattcag tgtgctcagg 4680 tcaagtctct cgctcggggc tctgtttcgg ggagaatggt ttcattccaa cgcactcatt 4740 atcaggattc tgttttc 4757 44 861 DNA Rattus norvegicus 44 aaggcgcgga tgagctcgct cgcccgcgcc gagctggaga agcgcatcga cagcctgatg 60 gacgagatag ccttcctgaa aaaggtgcac gaggaagaga tcgccgagct gcaggctcag 120 atccagtatg ctcagatctc cgtggagatg gacgtgtcct ccaagcccga cctctccgcc 180 gctctcaagg acatccgcgc tcagtacgag aagctggccg ccaagaatat gcagaatgcc 240 gaagagtggt tcaagagccg cttcacggtg ctaaccgaga gcgccgccaa gaacaccgac 300 gcagtgcgcg ctgccaagga cgaggtgtcg gaaagccgcc gcctactcaa ggctaagacc 360 ctagagatcg aagcctgccg gggtatgaac gaagctctag agaagcagct gcaggagctg 420 gaggacaagc agaatgcaga catcagcgcc atgcaggaca caatcaacaa actggagaat 480 gagctgcgaa gcacgaagag cgagatggct aggtacctga aggagtacca ggacctcctc 540 aatgtcaaga tggcattgga cattgagatt gcagcttaca ggaaactctt ggaaggcgaa 600 gaaaccaagc tcagtttcac cagcgtgggt agcataacca gcggctactc tcagagctcg 660 caggtctttg gccgttctgc ttacagtggc ttgcagagca gctcctactt gatgtctgct 720 cgagcattcc cagcctacta taccagccac gtccaggagg agcagtcaga ggtggaggag 780 accattgagg ctacgaaagc tgaggaggcc aaggatgagc ccccctctga aggagaagaa 840 gaagaggaga agaaggatga a 861 45 5865 DNA Rattus norvegicus misc_feature (1)..(5865) where n may be a or g or c or t/u, unknown, or other 45 ctcgtgagaa cgaatcgatc cttcccagcc ttctctgcct gctctccacc tcctctctgc 60 tccgagtctt aggagaacga acattcaaag gacagattcc aatgtggtgt gctgtgcaca 120 tcgcgagcgg ctggggtttg cacttcgaga tttctttata attttttttt ttaatgtaag 180 ggagacagtg gaattgctac ccgtagaatt tttattcaag tgcacgtcgc gttgggttgc 240 acgctccacc cccagggacc tggtgtggtg aaatttgaac ccaccgcctt agcccaaagg 300 ccgagtaacc tggctgcttg agtgtcgtgg aagacgtgag cgaaatgatc agcgaactca 360 ttttttatca gactcgctga agctggcttt tgcgtttttc tacacgtaca ctaattttat 420 ggaatagtta aagtgctata ttctccgcgc aaccttttca aattccaaat gtttgaacgt 480 tttggtgtca gcgcgagtga aatcatttta ccgacaagaa ctaactgaat tgtctgcctc 540 gttgagttgc ctccggaaaa gatctcgggg gtggaaaagc aactgcaaaa taacagacgg 600 agaaaattcc ttggaagtta tttctgtagc ataagagcag aaacttaaga gcaagttttc 660 attgggcaaa atgggggaac aacctatctt cagcactcga gctcatgtct tccagatcga 720 cccaaacaca aagaagaact gggtacccac cagcaagcat gcagttactg tgtcttattt 780 ctatgacagc acaaggaatg tgtataggat aatcagtcta gacggctcaa aggcaataat 840 aaatagcacc atcactccaa acatgacatt tactaaaaca tctcaaaagt ttggccaatg 900 ggctgatagc cgggcaaaca ctgtttatgg actgggattc tcctctgagc atcatctctc 960 aaaatttgca gaaaagtttc aggaatttaa agaagctgct cggctggcaa aggagaagtc 1020 gcaggagaag atggaactga ccagtacccc ttcacaggaa tcagcaggag gagatcttca 1080 gtctccttta acaccagaaa gtatcaatgg gacagatgat gagagaacac ccgatgtgac 1140 acagaactca gagccaaggg ctgagccagc tcagaatgca ttgccatttt cacataggta 1200 cacattcaat tcagcaatca tgattaaatg agatggataa atatgaagtt catttggttt 1260 cagaaactct tgagtgaaaa atcccaggtc agacttcttt aattaattaa ttgtttgctg 1320 ttgctcagat tgactgaata tttccattat ctgtgtagaa aaaggaacgt taattatagg 1380 agaaactttt tcaatggaca aaacattcca ttctatctat attttaaaga tcccttttgc 1440 taaccagttt tctgattttc tacatgttac gtaagactaa taacttgtga ttaggatcaa 1500 tggactcctg ctccaaagga aagccttgcc acaggcccac agaggtgcca cagaggacgg 1560 ggccaggcag gaacccgtca gcattgaagg ttgtttttgt atgccaacag gaggaaagct 1620 tgagttgctg ctgattctta aaagaattct gtattctaaa agatacacat catgttctaa 1680 atgcatttta aactagtgac attagttatt gggcatactg tggtattact agactacaaa 1740 gagggatatg aagtggcacc attganagta tttttttaaa agcctgtcta ccttaacact 1800 aatttttacc cttatttaaa tgctttttac taaacagttt taggtaaaat taagaaaaca 1860 gttttgttga ctgcacatct tttagaagga ccaactttta gagaattaca ttctttgaca 1920 gattaaaaat tgcaaagtga gatatttcaa actcttaagt gagttttatt gccgttggac 1980 tgcattaata cggacatacg attaaactta gtagaccaac actgagggat ctccttacca 2040 ggctgcagaa caaggaaatt aagcaataaa tgggacttgt gaatggaagg acactctact 2100 gctagtgcta gtaattctgc ataagatggt atacattttg aagaaagctg cttttaatta 2160 cttttaataa tgattttaat tactctagtg caagtgcttc ctcgagctat aaaggtagct 2220 gagcacagca gacctttatt ccctcagtct gactcctgta ctcatattca tttagtgaac 2280 atagtctttt aacagaagac cacagttctt tgatagcatt acaanactta cgttatttaa 2340 acgttataaa gaacgttatt gtaggataaa atgttaaaaa ctgtatcaag gacaggaaga 2400 attcctatct attaagtagt ggtttccacc tccacttaag actgaactgc actgaacagg 2460 taactgtata cttggtctga cacctagatt gaggccatcc gcactgaata ctgtgacatt 2520 taggagtaag aacttttaaa tttaacattt aaagaagcta cttccagttt atgcacctaa 2580 atttgtctaa atgttttcca ttttgctgac cccattgtat tcatactgct ccccagagcc 2640 tagagttgtc ctcatcctga cttcctgtcc ctgagtgtct gagaggagtc actttcactg 2700 taaagacact gcttctgcgc cttgtaggga ggacttgaca gtgctcccat agaaatccta 2760 cattatttca acctcatagt tacagtaaag gcaggttata accagtcttt cttattattt 2820 taagaatttc cagccctagt gttttatgaa agtattcctg tgaatttgat atcttatgat 2880 cctatattca tctaattcct taactgaaat aaaaatgtcc atgtgaggta ggttatttac 2940 agcgattgca ggagacatgg tgttcttcag agttcccaaa ccaggatagt ttcaaatagg 3000 tttttcatgg cttctgacga agaagaccgt aaagttccct gcagtgtgtc agtgatgtgc 3060 aagctgaatt agtgcaaagt gtcacactgt gaaagcacgt gcttttggct tattatgaga 3120 aaacgaaatc tttaaattca gtttatgtgt cttaggtcca gtttactttg atttgacaac 3180 tcagttcttc tgaccccacc ctagtatgta tgtatatgtg tgtgtatgtg tgtgtatgtc 3240 tgtatgtata tacatacata tacacacaca ttgtatacat atgctatata tacagtatgt 3300 gtatatatat actatatatg aatatatgaa tatatatatt caattagtta atagtacatt 3360 taagccaaat atccaacata agcacactat gtaagtatct atctggaaag acctatatag 3420 aattgagatc aacatttcat gagttagaaa caaaggattt tataattaat attacttaag 3480 tctaaagtac ccatatattt aaattagata tgcaattttt ccctcttggc aaagaaagac 3540 aaaaatcttg tgtttagaga tgatgtagat tgtcattttt gcctttcctt cctgagtact 3600 tgttttaaca acaacaaaaa aagactagtt taagaaaagg gattgtccag tatttttctg 3660 ctttgttaag tctaatttta ctgttaaaca gagagcagaa tcactggagt actggggggg 3720 ttttttgttg tttttttttt tttttctttt ctgttttttt cggagctggg gagcgaaccc 3780 agggccttgc gctcactagg caagcgctct accgctgagc taaatcccca acccctggag 3840 tatctgtttt aaaagaaagc caggaccgtt atgatggcca tacccngggt acatagtgaa 3900 aacaacagag accaagcaat gagagtgtga gagtaccaat ccaccagtac tgctgccgga 3960 catggcagct gcctgtgctt ttctgaagag tcatagtgta tgctaagtct agaaccatta 4020 cttagtaaag aggctatgac ttttatttgg gcctgaaaat tttagtggtg tggtcatagt 4080 ctattctgta tttgtaagct ttatttttaa attactgtgt tgatttagga acacaagaaa 4140 tgtttttatt tttaattatg agtgtatata aggttttcag atatgcacag actacaataa 4200 tagactccca tggagatacc acttcagcct taacagtcag ggagaaggag cctcacttta 4260 tcaccgcact caccctgctc tccactgatc tgttgttact gcggtgtgga ggttcacacg 4320 catgcaggtc ttcacacatg atgggtaggc ccgcaccaag tgagcctctc ccagccttgc 4380 cgtttcgttt ttttatttta atcttacatg tatgggtgtt ttgcatccag gcatgtcatg 4440 cctgtgtcca cagaagccag agagggtatc agattcccta aaactggagt tctcgatgat 4500 cgtgagcgng ccattgtggg tgctgggaac tgaagctggg tcctctacaa gagcagccag 4560 cgctcttaac cattgagcca ctatctgccc tgtgttttat ttatttattt atttatttat 4620 ttatttattt atttatttat ttattggttc tttttttttg gactggggac cgaagccagg 4680 gccttgcact tcctaggcaa gcgctctacc actgagctaa atccccaacc ccttgtttta 4740 tttttaaagc aaacgagata cataatttca accatgataa tttaagatta tcttgaactc 4800 ttaaggaaat gtatatacta agctattata gtttttattt tccctaattc agtggcataa 4860 taccttacct tgagtcgttt actactttct ttggtttcta aaaactctac tgctaaatta 4920 caatgtaaaa acatagggct cgtatatact gtagagtgct gtagatgtcc tcgtcatcaa 4980 ctatgcaata acagtctgat cgacacattt caggagcgat cactctttgg tgtgcttctt 5040 taaatacttt cagaagctta ggatgtgcaa agcaggaaga ccgtgggtgt aaatgtttac 5100 ttatttcttt gagagtgtta gtaagtcttt tctaaattgc ttttctcttc aaaattatcg 5160 ttaacttaaa tgataattat ctttgaggtt aaacagaagc tcattgacaa actaaagtga 5220 ccttttaggg cattctttga gatcatagtc ttatatctgg ggactaaaat gtcattagac 5280 cctaatagac taacttgtat gtttgtgtgg ggaaacgttt tcctctctca ttcaaggtaa 5340 ctgtttgctg cctgttgtta cttgtgtagc attctagaaa atggctaggt ttttttataa 5400 gatttaagac aatagaagta gttttatatt attatagttc tgttggaatg tgatcctgaa 5460 attattactg aaaattagaa tttttatttc gctaatgaca accttgactc tcagagatgc 5520 agtgtaaatt gatacctcat ctttccgaga gttcagagca cagggcggca gtatgtgaag 5580 ctgcttttgc actgacgcat tttgataagt ttggctactg taatggtaaa aggctcctca 5640 ggcactgact gcatttgggt tcttccgatg ggggatgatc cgttctcgtg gtgctgctgg 5700 acttatgcat tttggaggta ctgcatgtat cttccacact gcttgacatt ttctctgatc 5760 tgtgtgtttg caccaactca ttaaaagaaa tatgcagaaa tatcttctaa ttcgttgatc 5820 ttcgctgtat gacagttata atattaaaca cttgggttga tccac 5865 46 2295 DNA Rattus norvegicus 46 aatccgggct gagagaagga cgcgcacgga gtggtgcggc cgccacccgg ggcagacagc 60 agcgggtgct ctactgtgcc cggaatcccg gtcagaacat ccagtctacc actgttctct 120 gatgccatgc cagcaccaac tcaactgttt tttcctctcg tccgtaactg tgaactgagc 180 agaatctatg gcactgcatg ttactgccac cacaaacatc tctgctgttc accaccgtac 240 attcctcaga atcgtctgag atacacaccc catccagcat atgctacctt ttgtaggcca 300 agggagaact ggtggcagta tactcaagga aggagatatg cttctacgcc tcagaaattt 360 tacctcacac ctccacaagt caacagcatc cttaaagcta atgaatacag cttcaaagta 420 ccagaatttg atggcaaaaa tgtcagttcc attcttggat ttgacagcaa tcgtctgcct 480 gcaaatgcac ccatagaaga ccggagaagt gcaacaacct gcttgcagac cagagggatg 540 cttttgggcg tttttgatgg tcatgcaggc tgtgcttgct cccaggcagt cagtgaaaga 600 ctcttctatt atattgctgt ttccttgtta ccccatgaga ctttgctaga gattgaaaat 660 gcagtggaga gtggtcgggc actgctacct atccttcagt ggcacaagca ccccaatgat 720 tacttcagta aggaggcatc caaattgtat ttcaacggct tgaggactta ctggcaagag 780 cttatagacc tcaatactgg agaatcagct gatattgatg ttaaggaggc tttaattaat 840 gctttcaaga gacttgataa tgacatttca ttggaggctc aagtcggtga tcctaattct 900 tttctcaatt acctggtgct tcgggtagca ttttctgggg ctactgcttg tgtggcccat 960 gtagatggtg ttgacctcca tgtggctaac actggcgata gtagagccat gctaggtgtg 1020 caagaagaag atggctcctg gtcagcagtc acgctctcta atgaccacaa tgctcagaat 1080 gaaagagaac tacaacgtct aaaactggaa cacccaaaaa atgaggccaa gagcgtggta 1140 aagcaggatc ggctgcttgg cttgctgatg ccctttaggg cttttgggga tgtaaagttc 1200 aaatggagca ttgaccttca aaagagagtg atagagtctg gcccagacca gttgaatgac 1260 aatgaataca ccaagtttat ccctcctaac tatcatacac ctccttatct cactgctgag 1320 ccagaggtaa cttatcaccg attaaggcca caggataaat tcctagtgtt agcaactgat 1380 ggattgtggg agactatgca tagacaggat gtggttagga ttgtgggtga gtacttaact 1440 ggtatgcatc accaacagcc aatagcggtt ggggggtaca aggtgactct gggacagatg 1500 catggccttt taacagaaag gagagcaaag atgtcatcag tctttgagga tcagaatgca 1560 gcaacccatc tcattcgcca tgctgtaggc aataatgaat ttggagctgt tgatcatgaa 1620 cgactctcta aaatgcttag ccttcctgaa gagcttgctc ggatgtatag agatgacatt 1680 acaatcattg tagttcagtt caattctcat gttgtaggag catatcaaaa ccaggaacag 1740 taagtgatac tatcctggca gttcttcatt ttgaagggca gatcaatgtt caaaagaaac 1800 taatacagta aatatcccag tgggtcattc taaacacatc atgtttagta ctctagctag 1860 cccagtcttc atatctactg catcagatga tagcatcatg agtgtctgtt ctgtcctgtt 1920 ggacctcagg gtacctgcac ttgaggcagc ttgtttcctt acccaggtgt cttttaacaa 1980 tgactcacca actaagaata tggataggta gatacgatct tgaataggtc aaaagcaagg 2040 aacttctggg agtattagta aaggtagaaa acatcaccat gcccacctgc agactccttc 2100 catcataaga ctcctaatgt acatgagaat agttatttac tgcatttttc agatgaacag 2160 ttcaggtatt cacatacatt tatgttagcc taaagtgcag gtccagtatt ttcagccatt 2220 tatgaccatg aacaaagaaa tctggatttg taagtttttg tgattgtgtg ctaaagtatg 2280 cttcctgaac ggaaa 2295 47 386 DNA Rattus sp. 47 gaggtataaa aatctgtatt tatattacaa tgacataatg acacagcaca gccccatctg 60 ttagacaggt gggatggggt cccattccag tctgcagctc aggtctcata actcacccac 120 tgcactggca ccaggcccct tgtgtagccc cagactctgg cacggaacct gccctagaac 180 agcaagctca accacctgca ggttagaaat acagaggcag acctagggcc catatggaga 240 caaaacctcc gtgtcctctc ctgctagaca agctgggtct gcagcaggat ggtgggtctg 300 gtacatctgg accacctcct acgaaagagg cccaagcctg agcctggagt tgcagactct 360 gttatggagc acacgtggct cacatg 386 48 387 DNA Rattus sp. 48 catatatagc tgtgttcaga cacatcagaa gagagcatca gatctcatta catatggttg 60 tgagccacca tgtggttgct gggatttgaa ccaggacctc tgtaagagaa gtcagtgctc 120 ttaacactga gccatctctc cagcatttgt tttgttttaa aacaatgacc ataagaggaa 180 tggtggactt ctagcttgtt ctgcctacag taacaaaagc acatcacacc cccagctcac 240 aaacctgtgt gcaatgtcac taccaccaca caacggtact gcgttacact gttggaaacc 300 gcatttccac tcagtgcaga taaaagcaat aggaaacacc cagcccagag tcagattatt 360 ttctgttgtg aagtcaggcc tcgtgcc 387 49 425 DNA Rattus norvegicus 49 tttttttttt tttttttaca catgcgcggg ctttattata aagcacgttt cttgacctgg 60 ggcctccagc taggatgata ggacagcacc cagtctcggt ttaccaggac tcgcaactta 120 tcacaggcgc ctgagggctg tccagtaacg gagagggaac atagccactt ccggaggcgt 180 atggggtaag gtgtgtggaa aactttgaaa actccaaagc aagggccacc cctccatggc 240 tattgcacgc accggaagga actcaacgct gtacgtcagc tcttctccct gttaaaggga 300 cacagcgatc cttttttcct aaggcctcga aagacttccg cccacaccca atatggcggg 360 cgatgggagc gggtagaatt ccgggaatgt ggagcagatc actgggacgg agtggagcct 420 cgtgc 425 50 647 DNA Rattus sp. 50 ggattctttt cttttattat tggcatcagc tggactatat gtggccttaa acatcatgca 60 ctggacagga aagaaagaac gagaaaaaag gacacaggaa gggaacacga gtagcggcga 120 gattccgtaa taaaaactgt aattcattca ataggttaag ttttggcatt atgaaatcaa 180 acaccccttc ccacccccaa aagttgaaac ttaagtgaga gggtcgcccg tgggacccag 240 ccaggtgttc tgcgttccct cacaggctgt gacaggcagc gtgtgggtgt aattcttgtg 300 aacggtctca acttttcact ttctttaaat aatctctttt gcttcttaga cgttccggtt 360 cactggggtg acataattgc ggggaaacat gccggtctgc ccatggcaag cccctttcca 420 ccaattggga tctgagttat ccatgacatg aatgaagtct cctcggcgaa agcccagctc 480 accatcctcc tgggggtcaa agtcaaagag tgcctggacg tacgttggct gtggcacctg 540 ttctatgtcc cggaggaata tctgctggtt cctggacacg gatgttgatc tgtggtaatc 600 taccagctca ttcaaagaat taaacttcac cacccacagg aagtact 647 51 613 DNA Rattus sp. 51 aagaagtcat tggagtttat tcacagttaa tcactaccta ccaaattgct attcgcagaa 60 gttagaggcg taagtacata ggtttgtttt tttatttaaa cactgatctt taaatatata 120 cacacaaaac ttagttcagc aaggcttcat gatatacacc aattccaaaa taaaacaatc 180 aaatggccca ggtgtaactc cagagataaa tttttatcat cagcagggaa agaggcagca 240 gaaccaggag gggtgggaac aggctccgcc caggacgcct ccggggcctc aggggtgctc 300 cgagctgagt ccatctcaca aaacagagtc caggggggac cccaccctca agagtccagc 360 agcccacgga cgccgccaac tccaagggag agcctgggag tagccatgac tctgctgtgg 420 ggaggaggct gcgtggagag aaacggtgga caggcacatg actacggagg actgagagct 480 cacaggagac aagatttaaa aagctttgcg ttgctcttgg tcttttctac ctcaaaaggc 540 ttcatgggat ggggcagcag agcagctgca aacaccactg ttgtgcacag agatgcaacc 600 atgcaaccag ccc 613 52 458 DNA Rattus norvegicus 52 ttgaatgcat tttttaaatt ttattgtttt tcagtatgac agtgaaaatt ttacatagcc 60 catgttgaac ttagatattc aaataagact ttaccaataa taaaggagat taatacattg 120 ctagagttct acatttgact ttatgtcaaa agagtcaatt tagctctatg aattacagaa 180 gactttgtat tctgttcctt ttagctttat taaatgttga catctatgat tacatcagac 240 ttcgatacct cagttcacat ctacagaaaa ttttaaaagt gattctgaat actacagata 300 aaatattttt attgtaggta attattcaat ctgtatattt ggcttgatgc atgataagtc 360 atacgggaaa ggcatccgcg ctatacttga atttccagaa tctagcattc tttctttttc 420 aaaatatttt tttcttttta gcaataatat tgacagat 458 53 6820 DNA Rattus norvegicus 53 ccgagaacgg ctgcagtcct ctgacctgag accaatagct gtgtctaccc ggactcagcg 60 tccagctcac cgccactaac gcgccgcgca ttggacacct gatccacaca ccttcgggca 120 ccagtgaaaa accgcgactt gattttctgg aagaacgccc ccagggtgtg ggagcggtcg 180 tggaggacca gcaggaggaa gcggagggga gaggggcagt agtggaggca gagaaagcgt 240 tgaaccagct gtgttggccg aaggcacgaa acggcaaaag gcagcggtga gcatctgtgt 300 ggttcccgct gggaacctgc aggcaggacc ggcgtgggaa cgtggctggc ccgcggtgga 360 ccgcgtcttc gccacaatgg tccggctcct cttgattttc ttcccaatga tctttttgga 420 gatgtccatt ttgcccagga tgcctgacag aaaagtattg ctggcaggtg cctcgtccca 480 gcgctccgtg gcgagaatgg acggagatgt catcatcgga gccctcttct cagtccatca 540 ccagcctcca gccgagaagg tacccgaaag gaagtgtggg gagatcaggg aacagtatgg 600 tatccagagg gtggaggcca tgttccacac gttggataag attaacgcgg acccggtgct 660 cctgcccaac atcactctgg gcagtgagat ccgggactcc tgctggcact cttcagtggc 720 tctcgaacag agcatcgaat tcatcagaga ctccctgatt tccatccgag atgagaagga 780 tgggctgaac cgatgcctgc ctgatggcca gaccctgccc cctggcagga ctaagaagcc 840 tattgctgga gtgatcggcc ctggctccag ctctgtggcc attcaagtcc agaatcttct 900 ccagctgttc gacatcccac agatcgccta ttctgccaca agcatagacc tgagtgacaa 960 aactttgtac aaatacttcc tgagggtggt cccttctgac actttgcagg caagggcgat 1020 gctcgacata gtcaagcgtt acaactggac ctatgtctca gcagtccaca cagaagggaa 1080 ttacggcgag agtggaatgg atgctttcaa agaactggct gcccaggaag gcctctgcat 1140 cgcacactcg gacaaaatct acagcaatgc tggcgagaag agctttgacc ggctcctgcg 1200 taaactccgg gagcggcttc ccaaggccag ggttgtggtc tgcttctgcg agggcatgac 1260 agtgcggggc ttactgagtg ccatgcgccg cctgggcgtc gtgggcgagt tctcactcat 1320 tggaagtgat ggatgggcag acagagatga agtcatcgaa ggctatgagg tggaagccaa 1380 cggagggatc acaataaagc ttcagtctcc agaggtcagg tcatttgatg actacttcct 1440 gaagctgagg ctggacacca acacaaggaa tccttggttc cctgagttct ggcaacatcg 1500 cttccagtgt cgcctacctg gacacctctt ggaaaacccc aactttaaga aagtgtgcac 1560 aggaaatgaa agcttggaag aaaactatgt ccaggacagc aaaatgggat ttgtcatcaa 1620 tgccatctat gccatggcac atgggctgca gaacatgcac catgctctgt gtcccggcca 1680 tgtgggcctg tgtgatgcta tgaaacccat tgatggcagg aagctcctgg atttcctcat 1740 caaatcctct tttgtcggag tgtctggaga ggaggtgtgg ttcgatgaga agggggatgc 1800 tcccggaagg tatgacatta tgaatctgca gtacacagaa gctaatcgct atgactatgt 1860 ccacgtgggg acctggcatg aaggagtgct gaatattgat gattacaaaa tccagatgaa 1920 caaaagcgga atggtacgat ctgtgtgcag tgagccttgc ttaaagggtc agattaaggt 1980 catacggaaa ggagaagtga gctgctgctg gatctgcacg gcctgcaaag agaatgagtt 2040 tgtgcaggac gagttcacct gcagagcctg tgacctgggg tggtggccca acgcagagct 2100 cacaggctgt gagcccattc ctgtccgtta tcttgagtgg agtgacatag aatctatcat 2160 agccatcgcc ttttcttgcc tgggcatcct cgtgacgctg tttgtcaccc tcatcttcgt 2220 tctgtaccgg gacacacccg tggtcaaatc ctccagtagg gagctctgct atatcattct 2280 ggctggtatt ttcctcggct atgtgtgccc tttcaccctc atcgccaaac ctactaccac 2340 atcctgctac ctccagcgcc tcctagttgg cctctcttct gccatgtgct actctgcttt 2400 agtgaccaaa accaatcgta ttgcacgcat cctggctggc agcaagaaga agatctgcac 2460 ccggaagccc agattcatga gcgcttgggc ccaagtgatc atagcctcca ttctgattag 2520 tgtacagcta acactagtgg tgaccttgat catcatggag cctcccatgc ccattttgtc 2580 ctacccgagt atcaaggaag tctaccttat ctgcaatacc agcaacctgg gtgtagtggc 2640 ccctgtgggt tacaatggac tcctcatcat gagctgtacc tactatgcct tcaagacccg 2700 caacgtgccg gccaacttca atgaggctaa atacatcgcc ttcaccatgt acactacctg 2760 catcatctgg ctggctttcg ttcccattta ctttgggagc aactacaaga tcatcactac 2820 ctgcttcgcg gtgagcctca gtgtgacggt ggccctgggg tgcatgttta ctccgaagat 2880 gtacatcatc attgccaaac ctgagaggaa cgtccgcagt gccttcacga cctctgatgt 2940 tgtccgcatg cacgtcggtg atggcaaact gccgtgccgc tccaacacct tcctcaacat 3000 tttccggaga aagaagcccg gggcagggaa tgccaattct aacggcaagt ctgtgtcatg 3060 gtctgaacca ggtggaagac aggcgcccaa gggacagcac gtgtggcagc gcctctctgt 3120 gcacgtgaag accaacgaga cggcctgtaa ccaaacagcc gtaatcaaac ccctcactaa 3180 aagttaccaa ggctctggca agagcctgac cttttcagat gccagcacca agacccttta 3240 caatgtggaa gaagaggaca ataccccttc tgctcacttc agccctccca gcagcccttc 3300 tatggtggtg caccgacgcg ggccacccgt ggccaccaca ccacctctgc caccccatct 3360 gaccgcagaa gagacccccc tgttcctggc tgattccgtc atccccaagg gcttgcctcc 3420 tcctctcccg cagcagcagc cacagcagcc gccccctcag cagcccccgc agcagcccaa 3480 gtccctgatg gaccagctgc aaggcgtagt caccaacttc ggttcgggga ttccagattt 3540 ccatgcggtg ctggcaggcc cggggacacc aggaaacagc ctgcgctctc tgtacccgcc 3600 cccgcctccg ccgcaacacc tgcagatgct gcccctgcac ctgagcacct tccaggagga 3660 gtccatctcc cctcctgggg aggacatcga tgatgacagt gagagattca agctcctgca 3720 ggagttcgtg tacgagcgcg aagggaacac cgaagaagat gaattggaag aggaggagga 3780 cctgcccaca gccagcaagc tgacccctga ggattctcct gccctgacgc ctccttctcc 3840 tttccgagat tccgtggcct ctggcagctc agtgcccagt tcccccgtat ctgagtcggt 3900 cctctgcacc cctccaaatg taacctacgc ctctgtcatt ctgagggact acaagcaaag 3960 ctcttccacc ctgtagtgtg tgtgtgtgtg tgggggcggg gggagtgcgc atggagaagc 4020 cagagatgcc aaggagtgtc aacccttcca gaaatgtgta gaaagcaggg tgagggatgg 4080 ggatggagga ccacggtctg cagggaagaa aaaaaaaatg ctgcggctgc cttaaagaag 4140 gagagggacg atgccaactg aacagtggtc ctggccagga ttgtgactct tgaattattc 4200 aaaaaccttc tctagaaaga aagggaatta tgacaaagca caattccata tggtatgtaa 4260 cttttatcga aaaaaataat aaaacgtaaa aataaaatca acaaaaataa tctcttcttt 4320 tgctcaatcg tgcatacata tatctgccca cactcccgtg gtaaaactag aagcgaagca 4380 ggccctgcga tggtgccaac tgaatcctaa gttcatcatc ctagtgagca gatggagaga 4440 gggcaggagg cgggggtagg ttcggacaac agctcccatc tcagaccttg actgtgctga 4500 gtcttcagac tcctggacta aggaagaccc ggggactgac cttatgaggg tccctttcca 4560 ctgctgtgat ccattgccag cctgtagtca cccgggataa aggcacagta accttttgca 4620 ttcctgtgat tccctgtgtt taaggaaaag gaaagtatga gcaaagctat caccaaaaag 4680 agcgccatta gaagttacgg gggagaaaaa aagagaagca agatgatata taagcacagg 4740 gccttgaaca aggtgagcgt gcttcacaga ttccgtatta atgtacagat acttttggag 4800 aggagaaaga taacaaggag tgtcaggccg tttgtgaact cacttgcact gtgccaacca 4860 ggttctccgc tgcccttcag caaaagagga caagccgcgt tgccaggttt taccttccat 4920 ttactgtagc aaatactatc aaccagtcgg acttctaaga ttcagtttca gtttcagtac 4980 aatgcggtgc cactgtttct cccatgtgct atggaaacga atctatcttt gaacttaatg 5040 atgtattcat agcaactatt actggtttag attttttcct tttgtcacag gagtccctgg 5100 aactagtaac tgaaagtgtt ttcctgcgtt tcttgtatac atgtgattat gaaattcgtg 5160 ccatttaatg tcaatttagc tgtcactaga agactgtctt ttggatatag tataaatatt 5220 tttatgtacc agtgatgttc tccataccac ggttaccatg tttctctgga ggttgggtct 5280 gtggtctgat gtttctcatg tgcagcttcg atgggaattc ttctaagtgg gatttatttt 5340 tcagatattt tatgatatga gaatgttatt aatgaagtaa tttgaaagtg cattgtataa 5400 aaatggtcac caagcaatgc gtgacagtaa aaggtccgtt tttataaacc tgcgcacatt 5460 gttattaaaa tgtaaggttg aaaaggcaat atttagaata tttcagatat atttttaaaa 5520 agtttttcca cagctacttg agtttcatgg tcttctagta tataacaaca ctcaagtcta 5580 cccagagtgt ctcaactatc tgcttgtcaa ttctgcttaa ttttattttc atgcatttaa 5640 acttttatat ctttgttagc atctcttcct tatgatcctc atgtgtacta ttatgtaata 5700 accacataca tgtaatatcc acatacatgt aatatccaca catgtaacat tcacatacat 5760 gtagtccagt tattccatct tgaccctacc ttttcgaacc caaaagaaaa ttgttcttgt 5820 tatttttatt tcttctgtta tttgtgagat gaacccgttc cctttaaata atctttgttt 5880 gtgccttatg ttcagtcatt ttaatttgct gtcttcatgt cgaagctgct ggtttctcag 5940 ccaaaaagca tcatcttaga ctctctaaat agccaaagca tcatgagttt ggaatttaac 6000 atcagctccc atgtcagagt tgtgctcctc atgtgatccc acattctact gcccagtgta 6060 gtgaattcct ttccaagaac tcttgccttt gctttccaag ttatttttga gcatcttggt 6120 tgcagagatc tcaagaattt acgtcttgga ttccacgttt tcactacgaa gaaacagaat 6180 gagaagaaga agaaaaatta ggcagtgtag agctgggcgt agtggtccag gtctttaagc 6240 ccaggctagc ctgatttagc caataaattc taggcctaaa aagagagacc tgtctcaaaa 6300 ctcaaagcac acaacagatg ctaagtagat gggtctccat aattgggaag ccaatgagag 6360 aatgcatatt tcttcctatg ttctttaaaa cttgaagcag ttacatccgt ctttcatcat 6420 tacgggactc gtgcattcag agccttttgt tgttcttttg ccagaataga tgaggcaaca 6480 tttgcctatt cgaatgctgt aacaggcaag ttgactctag ggttttggtc tgagacattt 6540 ggtgaacacc ttcaacactg attaaaatat tactgaatgc ctactcttat cctgattatg 6600 aatcttccag aataaataga atattagctc atataattgt tcagaattgg agatgtatgc 6660 ctactaccct gtacctaaag ggcaaaaata tcttcactgt aatgtgtgtg cttcttcaag 6720 gtgttttgct tcttgtaaaa gtgttttcct ttggcttgtt actgcctttt gtcagataat 6780 cttgatgacg ctgtatcata ataaatattt tctatttatt 6820 54 759 DNA Rattus norvegicus 54 cctcaggctc agacacctgc tctactccaa gcaaatggct gctcttccaa tgctgtggac 60 cgggctggtc ctcttgggtc tcttgggatt tccacagacc ccagcccagg gccatgacac 120 agtgcagccc aactttcaac aagacaagtt cctggggcgc tggtacagcg cgggcctcgc 180 ctccaattca agctggttcc gggagaagaa agagctactg tttatgtgcc agacagtggt 240 agctccctcc acagaaggcg gcctcaacct cacctctacc ttcctaagga aaaaccagtg 300 tgagaccaag gtgatggtac tgcagccggc aggggttccc ggacagtaca cctacaacag 360 cccccactgg ggcagcttcc actccctctc agtggtagaa accgactacg atgagtacgc 420 gttcctgttc agcaagggca ccaagggccc aggccaggac ttccgcatgg ccaccctcta 480 cagcagagcc cagcttctga aggaggaact gaaggagaaa ttcatcacct ttagcaagga 540 ccagggcctc acagaggagg acattgtttt cctgccccaa ccggataagt gcattcaaga 600 gtaaacacag gtgagagaag tcagtcacag gtaacacatg gtgatgtggc ctcaggactc 660 ccgtgctctg tcactcttga gacccaagcc ctggctcccc aaagaccttc tccgccctcc 720 agctttgcct tggtggagaa ataaaatcca aagcaagtc 759 55 2591 DNA Rattus norvegicus 55 gaggaggaag gcgctgctgg cgagcctcag ccggccgctc agggggctaa ggaggtctgg 60 agggctcggc gcgactgagg gccggaccac tagaggtatc gggtggcgct gcggcagcag 120 caggtgcggg tggcggaggc tgcggctcca gtgccatccc ttattccacc tgcgctgtca 180 gcatgcacct tgcagaacca gaagcagctt gctggctttg aacgtgtggc aaatatttca 240 gaaagcttca agatcaagtt ggaggaaagg acggtttttc ttctaaattc atctgcttca 300 actattattc ttactgggaa tggacaatgg aatgctctct agatttatca tgaccaaaac 360 gctccttgtc ttctgcattt ccatgacctt atccagtcac tttggctttt cacaaatgcc 420 aactagttct gtacaagatg agaccaatga caacatcaca atattcacca ggatcttgga 480 cgggctcttg gatggctatg acaacagact gcggcctggg ctgggagagc gaatcacgca 540 ggtgcgaaca gacatctatg ttaccagctt tggcccagtg tccgacacgg aaatggaata 600 taccatagat gtatttttcc gtcaaagctg gaaagatgaa aggctgcggt ttaaggggcc 660 tatgcaacgt ctccctctca acaaccttct tgccaggaaa atctggaccc cagacacatt 720 cttccacaat gggaagaagt ccattgcgca caacatgacg acacccaaca agctgctgag 780 gctggaggat gatggcacac ttctctacac catgcgcctg acgatctctg ctgagtgtcc 840 aatgcagctt gaggactttc cgatggatgc ccatgcttgt cccctgaaat ttggcagtta 900 tgcttaccct aattcggaag ttgtctatgt ttggaccaat ggttccacca agtctgtggt 960 ggtggcagaa gatggctcca gactcaacca gtaccacctc atggggcaga cagtaggcac 1020 tgagaacatc agcaccagca caggtgaata tacaatcatg actgctcatt ttcacctgaa 1080 gaggaagatc gggtactttg tcatccagac gtaccttccc tgcatcatga cagtcatctt 1140 atcccaggtg tctttttggc ttaatcgaga atctgtccca gctaggacag tttttggagt 1200 gaccacagtg ctgaccatga caaccctcag catcagtgcc cggaattcgc tgcccaaagt 1260 ggcctatgcc acagccatgg actggttcat tgctgtctgc tatgcatttg tcttctctgc 1320 cctgattgaa tttgccacag tcaactactt tacaaagaga ggatgggcct gggatggcaa 1380 gaaggccttg gaagcagcta aaatcaagaa aaaagaacgt gaactcatac taaataagtc 1440 aacaaatgct tttacaactg ggaagttgac ccatcctcca aacatcccaa aggagcagct 1500 tccaggcggg actgggaatg ctgtgggtac agcctcaatc agagcatctg aggagaagac 1560 ttctgagagt aaaaagacct acaacagcat cagcaagatc gacaaaatgt cccggattgt 1620 gttccccatt ttgtttggca ctttcaatct agtttactgg gcaacatatt tgaataggga 1680 gcccgtgata aaaggggcta cctctccaaa gtaagacagg aaaccatact tgcacagaaa 1740 tgaacctgag gagaggtcaa gctcacagag actatttggg cgcctgtctt tcaggaaatt 1800 ttgcatgttt aataatatgt acaaataata ttgccttgat gtttctatgt gtaacttcaa 1860 tgtttcaagg atgtccctta ataaaccaag caaatggcct tctacaacaa cgggaggcaa 1920 tgactgactc tcagatgctc agcgtcctaa catcaatagt ttacaaacaa gataagtata 1980 tttttaactg ttctggtata tgacgttttt atacttcgaa tgccatttcg taccattttt 2040 cccagccaac agaacatttt agggaatccc tgtgatgacc acttgacagg tgaaaaagca 2100 aagatcctcg ggtacacaaa gtccatgaag agcaaactgt ggacatttaa gtccagtacg 2160 aattgccttt aacaattctt cttgttctga aattagaaaa atactgcatg aactgacatt 2220 aagaagtaga taagcaaaca tttatgcaga caaatttaat gacaagccca tagtgtctta 2280 gattagtaga tcaaataatt ccccaaggaa aagaaatcaa ctgattcaaa attaattttg 2340 ttgttttttg tgaaaaatga atttatttct caccccgccc caacaccttt acaaccttaa 2400 taatgactaa gaaagcaaaa tcttaaaacc ttaaagcaac aaggccttgg tctttggtgg 2460 tggtattctg tggccattgt ttctgaccct gggtcctctt gctgctgctt cagcgctgag 2520 aaattgtaat tgagttattt tctgttttat ttccctgtac atatttcatg gttggattat 2580 cgctctgtta g 2591 56 2977 DNA Rattus norvegicus 56 gaattcggcg gatggaagcc agctgtcccg agaagcagtg aactgtggcg tcatcccgag 60 cagtgcctta ccggtattgt gctgcttcac ctgcctcgct cggcgttctc ctcaggcccc 120 gccatggagc gacaggtcca acgacttcgc cagacgttcc ggtccggccg atcgcggccg 180 ctgcgtttcc gactgcagca gctcgaggcc ctccggagga tggtgcaaga gcgagagaag 240 gacatcttgg cagccatcgc agcagacctg agcaaaagtg aactcaatgc atacagtcat 300 gaagtcatta ccatccttgg ggagattgac ttcatgctgg ggaatcttcc tgaattggcc 360 tctgctcggc cagcgaagaa gaacctgctt accatgatgg acgaggccta tgttcagcca 420 gagcctctgg gagtcgtgct gattattgga gcttggaact atccttttgt tctgaccctg 480 cagccactgg tgggagccat tgctgcagga aatgctgcca ttgttaagcc ctcggaactc 540 agtgaaaaca cggctaagat cttggctgaa ctcctccctc agtatttaga ccaggacctg 600 tacatgattg ttaatggcgg cgttgaagaa accacagagc ttctgaggca gcggtttgat 660 cacattctct acacaggaaa caccgcagtt ggaaaaattg tcatggaggc tgctgccaag 720 cacctgaccc ctgtgaccct ggagctcggg ggcaaaagcc catgctacat tgacagagac 780 tgtgacctgg acgttgcttg cagacggata acctggggaa agtacatgaa ttgtggtcag 840 acctgtattg ctcctgacta tatcctgtgt gaagcctcct cccaggatca aatcgtacag 900 aagattaagg atacggtgaa ggacttttat ggggaaaatg taaaagcttc tcctgattat 960 gaaaggatca tcaaccttcg tcactttaag aggataaaaa gtttgcttga aggacagaaa 1020 atagcttttg gtggggagac tgatgaagct acacgctaca tagccccaac catactcact 1080 gatgttgacc ctaactccaa ggtgatgcaa gaagaaattt ttggaccaat tctcccaata 1140 gtgtctgtga aaaatgtgga ggaagccata aatttcataa atgatcgcga aaagcccctg 1200 gcactctaca tattttctca caacaataag ctcatcaaac gggtgattga tgagacatcc 1260 agtggtggag tcacaggcaa tgatgtcatc atgcacttca ctgttaattc tttgcccttt 1320 ggaggtgtgg gtgccagtgg aatgggggct tatcatggca aatacagttt cgataccttt 1380 tctcatcagc gcccctgctt gttaaaaggg ttaaagggag agagtgttaa caaactcagg 1440 taccctccca acagcgagtc caaggtcagc tggtcgaaat tcttcctgct gaaacagttc 1500 aacaaaggaa ggctgcagct gctgcttctc gtgtgcttgg ttgcggttgc agctgtgatc 1560 gtcaaggatc agctgtgatg acttccttgt agcctctact gaagtacccc tcggccaaat 1620 ggttaacaca ccaatgcttt taaaattgta cccaaaccag gaaatgaaat tcacaggtga 1680 actgcagtca aacctaagtt gttgccacaa accactgatg aaactcagtg cttcagccaa 1740 atcccagcat ttgtcagccg tgcaggtgct gagagggtgg agactgggag gggcgacacc 1800 tagtccatgg cagcgggatg tcagggagac tcgacaactg ctcccgcact ctttgctcca 1860 ggacatagct ctcccacccg gtgtcaacac cctccaggct ttccagctgt cctctgattg 1920 ctgaggttcc tgttagggac ccaggtacta aacctgggcg ggtggatttg tcggcctcat 1980 ccattgtggc tcgagaccgg ccttcgggag tcggctctca gtctaaacat cctttctcat 2040 tcatagtgtg tcacccgaag atgcttgttt gtgacattgt gacagtctgt catgactgtc 2100 ccggtgcctt tgtgatgact taaactacac tgaggagctt gccaacttgt gaatgccctt 2160 cagagggtct ggcagtcaca gctgttccag agcccgaggg acgaagattc cggagcccgg 2220 agtttgaggc caacctaggc aacataatgg gaccctctca ttattattcc tccataacaa 2280 tcccctcgag accctcgatt tgaatgttat ataggtcttc aggataaatc tgcttatttt 2340 cacagcacaa cacaaaaaaa atttactttt gaaatcttag agagattcct acagatctta 2400 gcatggagct gttcctgtag tgaaaggggg gttattagac atgaggcttc agaactcatg 2460 gggcagggtt gttggagact accgtgagct gagggggcac actgaagcga tgggatggcc 2520 agaagcgcac ctgagcaagc ggggcagcat tctctgtcag accctaacat ggctacacgg 2580 ggatgtggca gagagatctg tgccgttggc tgccagcgct ggttaggcct gaagctccaa 2640 gctgcagagg tctcattgcc ttcccaggat ccaaattaag actgcccact caatgagaat 2700 gtcacttgcg tatgtacaac catgtttgct gagtaacctg ttccaccgtt gaggctgtct 2760 gaagtgtatt gtatgaggta tcaagaacga gtcattggcc catttggcaa atagttgctt 2820 atgtagcaat tgtcatggac taatcataaa atattttgca caaaatttca atgttgaact 2880 tgcactcact gttgttaaat tataaatcac agcttctagt taggccaaaa tatttacata 2940 ctctactaat cttcaaaata aatgtatccc ggaattc 2977 57 584 DNA Rattus norvegicus 57 ttgactcaga agcagacgtt tatttactta aaacatttca caattatatt gacaattatt 60 tatttaaaat aatattctaa atacttacaa aaataaatca agtattgctt ctagttttca 120 agtgtgacag catttgccat agaaaataga agggaccttt tgaataaact aaacgctaca 180 ctgaaacttc ctcaagttta ataactacac actatttcct ctgatgactt gtgctttata 240 aatgtccaaa ctatttctgt gcaacaaaca ggatcacagt aagactgaat gtataacaag 300 ccatatacat tcatttttac acaaaatgtc agtactctgg cccaaaccac catcatcccc 360 aaaatgccag atacaccatt atacagaagt acatgtggtt gtgaagtttt taacatccag 420 tcatcaaaga ccagatattt attattaaag tatactcagc taaaggttgc taggcaagac 480 attttttact tacattaaat cattagtcaa atcattaaaa aaatagaata aacttcaaat 540 atgtctcaat acattaagtt aatcccaaga taaacatcag tgtc 584 58 1780 DNA Rattus norvegicus 58 tgtgctgggt gtgccccaga gctggctttg actgtacgct gtcaggctct ccctggacct 60 cacggaacag cattgccagc cacacggctt ccaacaaatc acctcttttc atgctgtttg 120 gcacagatcg aatctacagg ttatacaatg gtcgatggag caatgatcct ttctgtgcta 180 atgatgatgg ctctcccttc cccgagtatg gaagatgagg agcccaaggt caacccgaag 240 ctttacatgt gtgtgtgtga gggcctctcc tgcgggaacg aggaccactg tgagggccag 300 cagtgttttt cctccctgag cgtcaatgat ggcttccgcg tctaccagaa gggctgcttt 360 caggtctatg agcaggggaa gatgacgtgt aagaccccgc cgtcgcctgg ccaggctgtg 420 gagtgctgcc aaggggactg gtgcaacagg aacgtcacgg cccggctgcc cactaaaggg 480 aaatccttcc ctggatcgca gaacttccac ctggaagttg gccttatcat cctctccgtg 540 gtgtttgcgg tatgcctttt cgcttgcatc cttggcgttg ctctcaggaa gtttaaaagg 600 cgcaatcaag agcgcctgaa ccccagagac gtggagtacg gtactatcga agggctcatc 660 accaccaacg tcggagatag cactctagcg gaattactag atcactcatg tacatcagga 720 agtggctccg gtcttccttt tctggtacag agaactgtgg ctcgacagat aaccctgttg 780 gagtgtgtcg ggaagggccg gtatggagaa gtgtggaggg gcagctggca aggcgaaaat 840 gttgctgtga agatcttctc ctcccgtgat gagaagtcgt ggttcaggga gacagaattg 900 tacaacacgg ttatgctgag gcatgagaat atcttaggtt tcattgcttc agacatgacc 960 tctagacact ccagtaccca gctgtggctc attacacatt accacgaaat gggatcgttg 1020 tatgactacc ttcagctcac cactctggac acggttagct gccttcggat cgtgttgtcc 1080 atagccagcg gccttgcaca cttgcacata gagatatttg ggacccaggg gaagtctgcc 1140 atcgcccacc gagatctaaa gagcaaaaac atcctcgtga agaagaacgg acagtgctgc 1200 atagcagatt tgggcctggc agtcatgcat tcccagagca cgaatcagct tgatgtggga 1260 aacaaccccc gtgtggggac caagcgctac atggcccctg aagtgcttga tgaaaccatc 1320 caagtggatt gctttgattc ttataagagg gtcgatattt gggcctttgg cctcgttctg 1380 tgggaagtgg ccaggaggat ggtgagcaat ggtatagtgg aagattacaa gccaccattc 1440 tatgatgttg ttcccaatga cccaagtttt gaagatatga ggaaagttgt ctgtgtggat 1500 caacagaggc caaacatacc taacagatgg ttctcagacc cgacattaac ttctctggcg 1560 aagctgatga aagaatgctg gtaccagaac ccatccgcca gactcacagc tctacgtatc 1620 aaaaagactt tgaccaaaat tgataactcc ctagacaaat taaaaactga ctgttgacat 1680 tgtcaccggt gtcaagaagg agagtcaatg ctgtcattgt ccagctggga cctaatgctg 1740 gcctgactgg ttgtcagaac agaatccatc tgtccccctc 1780 59 3032 DNA Rattus norvegicus 59 agtgctctgg cgagtagtcc tccctcagcc gcagtctctg ggcctcttca gcttgagcgg 60 cggcgagcct gccacactcg ctaagctcct ccggcaccgc gcacttgcca ctgccactgc 120 cgcttcgcgc ccgctgcagc cgccggctct gaatccttct ggcttccgcc tcagaggagt 180 tcttagcctg tcccgaaccg taaccccggc gagcagatgg agctggacca tatgacgacc 240 ggcggcctcc acgcctaccc tgccccgcgg ggtgggccgg ccgccaaacc caatgtgatc 300 ctgcagattg gtaagtgccg agctgagatg ctggagcacg tacggaggac ccaccggcat 360 ctgttgaccg aagtgtccaa gcaggtggag cgagagctga aagggttgca caggtcggtg 420 ggcaagctgg agaacaactt ggacggctat gtgcccacgg gcgactcaca gcgctggaag 480 aagtccatca aggcctgtct ctgccgctgc caggagacca tcgccaacct ggagcgctgg 540 gtcaagcgtg agatgcacgt gtggagggag gtcttctacc gtctggagag gtgggccgac 600 cgcctggagt ccatgggcgg caagtaccca gtgggcagcg agccggcccg ccacactgtc 660 tctgtaggtg tggggggtcc agagccctac tgccaggaag ctgatggcta cgactacact 720 gttagcccct atgccatcac cccgccacct gccgcaggag agctgcctga gcaggagtca 780 gttggggctc agcaatacca gtcttgggtg ccaggtgagg atgggcaacc aagcccaggt 840 ctggataccc agatctttga ggacccacgg gagttcctga gccacctgga agagtacctg 900 cggcaggtgg gtggctctga agaatattgg ctgtcccaga tccagaacca catgaatggg 960 ccagccaaga agtggtggga gttcaaacag ggctcggtga agaactgggt ggagttcaag 1020 aaggagtttc tgcagtacag tgagggtacg ctctcccgcg aagccattca gcgggagctg 1080 gacctgccac agaagcaggg tgagccactt gaccagttcc tctggcgtaa gcgggacctg 1140 taccagacac tgtatgtgga cgctgaggag gaggagatca ttcagtatgt ggtgggcacc 1200 ctgcagccca agttcaagcg ctttctgcgc cacccacttc ccaagaccct ggagcagctc 1260 atccagaggg gcatggaagt tcaggacggc ctggagcagg cagctgagcc ttctgtcacc 1320 cctctgccca cagaggatga gactgaggca ctcacgcctg ctcttaccag cgagtcagta 1380 gccagtgaca ggacccagcc tgaatagagg ggccagccca gggtccccag cctgcctgcc 1440 acacccagtc tgtggctttt gtcaactagg acttgattga gctggggctg acacccaagg 1500 ggatgccctg tccagccaga caccttctca cccactggcc tgactcacaa ctgccacaca 1560 accatgattc atggacatca agaagcccct ctcccatagg gctcccacct gccacctacc 1620 cctcacctgt ctgccctagt cctggccctg tctccagtgg cctcaccctc tacactctca 1680 gaccatcaca gaacaccttt ggcttcctca ttctgcatca gtgtccaggg ccctttgggt 1740 agtcaagaaa tcaagtgtct gaaaggcaat gaaaagtagg caccaaaccc aaggggcatc 1800 ccagggcaga tgctaaagca gaatcagaga tggccgaagg aacctctact tccggggatg 1860 cagcccgctc ctacagacac agcagatcca gctggtgccc tacctgcctc ccagagcaac 1920 tggccagtct tgggcagcat agctcccctc tcagggtgag ctgaagcagc agacctgacg 1980 cgctggcgcc tcctggcccc cagcagtgat tcataccagt gaagaaaagc agacttcggc 2040 tccatgactc agccatgcca ggcggagggt cccagagggg ctgagtcctc agccccagct 2100 gaggcagcag ctggagtctt cagagccagg tgaatgacac caggtctcaa gctgctgaga 2160 agtctttccg gccatgtctg gaaggggtac caccccagca ccagcaccgt cccctcctct 2220 cttgaagctg cctgcacaga ggttccaaga cactttcaag gcagagaaaa taggattaca 2280 aagaggaggt gccttggcag agggcagcac ccagctcagc ctcagagctg aaggtgaaga 2340 caagccagcg tgaaaccccg ggtctgccac gaatgcccgc tccgctggcc actcaccagc 2400 tgcctgccac aagccactgc agcttgagca gggtctgtgc cctctcagca cagagcccag 2460 ttcgctgcgt ggcctttggc ccccgccaga accttgcagg agccttaagg ttcgggccct 2520 agcccagcct gaccttacct gctgtgccct gcctgctggt caagtccagt cccaggagac 2580 cccatgcctt ggctcctagg ctgttccagg cacttccctg acctgccggg tgattgccca 2640 gctggaacct catccacacc ccagcaccaa ccacctcgtg ttggtaactg ctcgtgtctg 2700 tagtctgagt aggccatgtt gaggttcctc catctgcctg gtccattggt gttctgagac 2760 cagttccact gctgttctga cagatccccc accctgtgcc cctgccagcc cccacaggtt 2820 tatttttgca cataaaccat gacccatact aatttggcta gctctgggga ctagggagac 2880 cctggagatc tcaagagtgt ggctatcccc tattttcacc aagccttcaa tatccagcca 2940 ggccatctgg cccacaccat cttacctcaa agacagacat atatatatat atacatatat 3000 atgattttgt taataaaact atgaaattta aa 3032 60 474 DNA Rattus sp. 60 atgaccaagc atcacaaaca cttttatttg tggttatcag ttatttttac agaataaaga 60 actcattaat ttgtaacact gtacattaaa ttaaaatata acccatccct acatcaaaaa 120 ttatctaagt tgaccaagat aaaaaaagtc tctaaaagct tatatacatt agaagtagca 180 aaaataataa taaaggaaga aattagaaaa cagacatcaa agtcagacat ctagaagaat 240 tctccaacat ctgctctctt atctcggcat ttgcttcggg cttttgttcg agctttgaaa 300 gctgcagagt tatataaatg cctttcaaaa cgaggaatct tcatggtttt aagtgttgca 360 gcatcgagca tcacaggggg tcccagctca aatacattgc gaaggaattc atttgtctgc 420 aagtggtact gcatccctga tccaagagcc tctttaaacg tgtcataagt gtgc 474 61 1614 DNA Rattus norvegicus 61 tactctctcg ctctctttct gtctcttcct cgctccctct ctttctctcc tccctctgcc 60 ttcccagtgc ataaagtctc tgtcgctccc ggaacttgtt ggcaatgcct atttttcagc 120 tttcccccgc gttctctaaa ctaactattt aaaggtctgc ggtcgcaaat ggtttgacta 180 aacgtaggat gggacttaag ttgaacggca gatatatttc actgatcctc gcggtgcaaa 240 tagcttacct ggtgcaggcc gtgagagcag caggcaagtg cgatgcagtc tttaagggct 300 tttcagactg tttgctcaag ctgggtgaca gcatggccaa ctacccgcag ggcctggacg 360 acaagacgaa catcaagacc gtgtgcacat actgggagga tttccacagc tgcacggtca 420 cagctcttac ggattgccag gaaggggcga aagatatgtg ggataaactg agaaaagaat 480 cgaaaaacct caatatccaa ggcagcttat tcgaactctg cggcagcggc aacggggcgg 540 cggggtccct gctcccggcg ctttccgtgc tcctggtgtc tctctcggca gctttagcga 600 cctggctttc cttctgagca cggggccggg tcccccctcc gctcacccac ccacactcac 660 tccatgctcc cggaaatcga gaggaagagc cattcgttct ctaaggacgt tgtgattctc 720 tgtgatattg aaaacactca tatgggattg tgggaaatcc tgtttctctc tttttttttt 780 tttaattttt ttttattttg gttgagtcct tgtgttttag ttgccaaatg ttaccgatca 840 gtgagcaaag caagcacagc caaaatcgga cctcacctta agtccgtctt cacacaaaaa 900 taagaaaacg gcaaactcac ccccattttt aattttgttt ttaattttac ttacttattt 960 atttatttat tttttggcaa aagaatctca ggaatggccc tgggccacct actatattaa 1020 tcatgttgat aacatgaaaa atgatgggct cctcctaatg agaaagcgag gagaggagaa 1080 ggccagggga atgagctcaa gagtgatgcc cacgtgggga gcatctggtg aataatcgct 1140 cacgtctttc ttccacagta ccttgttttg atcatttcca cagcacattt ctcctccaga 1200 aacgcgaaaa acacaagcgt gtgggttctg catttttaag gataagagag agaaagaggt 1260 tgggtatagt aggacaggtt gtcagaagag atgctgctat ggtcacgagg ggccggtttc 1320 acctgctatt gtcgtcgcct ccttcagttc cactgccttt atgtcccctc ctctctcttg 1380 ttttagctgt tacacataca gtaatacctg aatatccaac ggtatagttc acaagggggt 1440 aatcaatgtt aaatctaaaa tagaatttaa aaaaaaaaga ttttgacata aaagagcctt 1500 gattttaaaa aaaaagagag agatgtaatt taaaaagttt attataaatt aaattcagca 1560 aaaatttgct acaaagtata gagaagtata aaataaaagt tattgtttga aaaa 1614 62 6075 DNA Rattus norvegicus misc_feature (1)..(6075) where n may be a or g or c or t/u, unknown, or other 62 agcgggtcgg tccacccgag aaaggggngc ggaggaactg ggaccgaccc gggaagactg 60 cgccagaggt gcacagagat cgcgcgggga gcgaggagcc gggcctcggc gctcagcccg 120 gatccacgtc ccgcgaagca ccggctccgg ccggccgcgg ggtcatgacg tacagcgagc 180 tctacagccg gtacacgagg gtctggatcc ccgacccaga tgaggtgtgg cgctcggctg 240 aactaaccaa ggactacaaa gatggggatg agagcctaca actcagactg gaagatgaca 300 caattctgga ttacccaatc gatgtccaaa ataaccaggt gccattcttg cggaatccag 360 atatcttagt aggagaaaac gacctcactg cactcagcca tctccacgag cccgcagttc 420 tgcataactt aaaggtccgt ttcctggagt ctaaccacat ctacacttac tgtggaattg 480 tgcttgttgc cattaatccc tatgagcagc tgccaatcta cgggcaggat gtcatctatg 540 cctacagtgg ccaaaatatg ggcgacatgg atccccacat ctttgctgta gcagaagaag 600 cctacaagca gatggccaga gatgaaaaga accaatccat catagtcagc ggagagtctg 660 gagcagggaa gaccgtgtca gccaagtatg ccatgcgcta ttttgccaca gttgggggct 720 cagccagtga taccaacatc gaagagaagg tcctggcttc cagtcccatc atggaggcca 780 tcgggaatgc caagacaact cgcaacgaca atagcagccg atttgggaag tacattgaga 840 tcggctttga taaaaagtac cacatcatcg gggccaacat gaggacctat ttgctggaga 900 agtccagggt ggtctttcag gcggatgatg agaggaacta ccacatcttt taccagctct 960 gtgctgcggc cagccttcct gagtttaagg agctcgcact aacatgtgca gaggactttt 1020 tctacaccgc ccatggagga aacacgacca tcgagggtgt agatgatgca gaggactttg 1080 agaagacaag acaagccctc acactccttg gagttcggga gtcccatcaa atcagcatct 1140 ttaagataat tgcttctatc ttgcaccttg gaagtgtgga gatccaggcc gagcgggatg 1200 gggactcctg cagtatatcg ccccaggacg aacacctgag caacttctgc cgcctgctag 1260 gaatagagca cagtcagatg gagcactggc tgtgtcatcg gaagctggtc accacctccg 1320 agacctacgt caagaccatg tccctgcagc aagtggtcaa cgcacgcaat gccttggcca 1380 agcacatcta tgcccagctg ttctcctgga ttgtggagca catcaacaag gccctgcaaa 1440 cctcccttaa acaacactcc ttcatcgggg tcctggatat ttatgggttt gagaccttcg 1500 agattaatag cttcgagcag ttttgtatca actatgccaa cgaaaagctc cagcaacagt 1560 tcaactcgca tgtgttcaag ctggagcaag aagagtatat gaaggagcag atcccgtgga 1620 ccttgattga cttctatgat aaccaaccgt gcatagacct catagaagcc aagctgggta 1680 tcctggacct gttggatgag gagtgtaagg tccccaaagg aactgatcag aactgggctc 1740 agaaactcta tgaacggcac tccaacagtc aacacttcca gaaaccacgc atgtccaaca 1800 cggccttcat tgtcatccac tttgcagaca aggtggaata cctttcagat ggttttctgg 1860 agaaaaacag ggacacggta tatgaagaac agatcaacat cctgaaagcg agcaagtttc 1920 cgctagtggc tgatttgttc cgtgatgacg aggactctgt tcctgccacc aacacagcta 1980 agagtcggtc atcttcaaag atcaatgttc gttcttccag acccctcatg aaggccccca 2040 acaaggagca caagaaatcc gtgggctacc agttccgcac ttccctaaac ctgcttatgg 2100 agactctgaa tgccacaacg ccccactacg ttcgatgcat caagcccaac gacgaaaagc 2160 tccccttcca cttcgaccca aagagagctg tgcagcagct cagagcctgc ggcgtgttgg 2220 agaccattcg gatcagcgcg gcaggctacc cgtccaggtg gacctaccat gacttcttca 2280 accggtatcg ggtgttgatg aagaagagag agcttgccaa caccaccgac aagaagaata 2340 tctgcaagtc tgtcctggag agtctcatca aggatccaga caagttccag tttggccgca 2400 ccaagatctt cttccgggca ggtcaggtgg cctacctgga gaagcttcgg gcagacaagt 2460 tccgggaggc caccatcatg atccaaaaga cggtcagggg ctggctgcag agagtgaagt 2520 accggaggct gagagcagct acgctaaccc tgcagagatt ctgccgagga tacttagccc 2580 gcagactgac tgagcacttg aggagaaccc gggcggccat agtgttccag aagcagtacc 2640 gcatgctgaa ggcccgccga gcctactgca gggtccgcag ggctgcggtc atcatccagt 2700 cctacacgag gggccatgtg tgtacgcaga agctaccgcc agtcctcacg gagcacaaag 2760 ccaccatcat ccagaagtac gcccggggct ggatggcacg gagacatttt cagcggcagc 2820 gggatgcagc cattgtcatc cagtgtgcct tccggaggct caaggccagg caggcactga 2880 aggccctcaa gatcgaggcc cgttctgcag agcatctgaa acgcctcaac gtgggcatgg 2940 agaacaaagt tgtccagctg cagcggaaga ttgatgacca gaacaaagag ttcaagactc 3000 tgtcagagca gttgtctgca gttacctcca cccatgccat ggaggtggag aagctgaaga 3060 aggagctggc gcgttaccag cagaaccagg aggctgaccc cagccttcag ctgcaggagg 3120 aggtacagag cctgcgcacc gaactacaga aggctcattc agagcgcagg gtcctagagg 3180 atgctcacaa cagggagaat ggtgaactga gaaagcgagt cgcagacctg gaacatgaaa 3240 atgcactctt gaaggatgag aaagaacacc ttaaccacca aatcctgcgc cagtcaaaag 3300 ctgaatcttc acagagctct gtggaggaaa acctgctgat taagaaggaa ctggaggagg 3360 aacggtcccg gtaccagaac ctcgtgaagg agtactccca gctggagcag agatatgaga 3420 accttcggga cgagcagcaa actccaggcc acaggaagaa cccatcaaat caaagtagct 3480 tagaatctga ctccaattac ccctccattt ccacctcaga aatcggagac accgaggatg 3540 ccctacagca ggtggaggag attggtatag agaaggcagc catggacatg actgtcttct 3600 tgaagctgca gaagagagtg cgggaacttg agcaggagag gaagaagctg caggtgcagc 3660 tagaaaagga acaacaggac agcaagaaag tgcaggtaga acaacaaaac aatggcctag 3720 atgtggacca ggatgcagat atagcctaca atagtctgaa gagacaagag ctcgagtcag 3780 agaacaagaa gctgaagaat gatctgaatg agcgctggaa ggctgtagcc gaccaagcca 3840 tgcaggataa ctccactcat agctcccccg acagctacag cctcctactg aaccagctca 3900 agctggccaa cgaggagctg gaggtccgca aagaggaggt gctgatcctc aggacccaga 3960 tcatgaatgc tgaccagcgc agactgtccg gcaagaacat ggagccgaac atcaatgcca 4020 gaacaagttg gcccaacagc gagaagcacg tggaccagga agatgccatt gaggcctatc 4080 acggggtctg ccagacaaac agccagactg aggattgggg atatttgaat gaagatggag 4140 aactcggctt ggcttaccaa ggcctaaagc aagtcgccag gttgctggag gcccagctgc 4200 aggcccagaa cctgaagcat gaggaggagg tggagcatct caaggcccag gtggaggcca 4260 tgaaggagga gatggacaaa cagcagcaga ccttctgcca gactctgctg ctctccccag 4320 aggcccaggt agaatttggt gtccagcagg agatatcccg gctgaccaat gagaacctgg 4380 actttaagga gttggtggaa aagctggaga agaatgagaa gaagctgaaa aagcagctga 4440 agatttacat gaagaaggtc caggacttag aagctgccca ggcattggca cagagtgaca 4500 ggaggcacca tgaactcaca agacaggtca cggtccaacg aaaagagaag gacttccaag 4560 gcatgctgga gtaccacaaa gaggacgagg cactccttat ccggaacctg gtgacagacc 4620 tgaagcccca gatgctgtcg ggcaccgtgc cctgtctgcc tgcatacata ctctacatgt 4680 gcatcaggca cgcggactac accaacgatg acctcaaggt gcactcgttg ctgagctcca 4740 ccatcaatgg cattaagaaa gtcctcaaga agcacaatga ggactttgag atgacgtcat 4800 tctggttatc caacacctgc cgcctccttc actgtttgaa gcagtacagt ggggatgagg 4860 gtttcatgac acagaacacg gcgaagcaga atgagcactg tctcaagaac tttgacctca 4920 ctgaataccg tcaggtgcta agtgaccttt ccattcagat ctatcagcaa ctcattaaaa 4980 ttgctgaggg cctgctacag cctatgatag tttctgccat gctggaaaat gagagtatcc 5040 aggggctgtc aggggtgagg ccaactggct accggaagcg ctcgtccagc atggtggacg 5100 gagaaaattc gtattgcctg gaggccatca tccgccagat gaatttcttt catacagtcc 5160 tgtgtgacca gggcctggac cccgagatta tcctgcaggt gttcaaacag cttttctaca 5220 tgatcaatgc ggtgactctt aacaacctac tcctgcggaa agatgcctgc tcctggagca 5280 ccggcatgca actcaggtac aacataagtc aactcgaaga gtggcttcgg ggaaaaaacc 5340 ttcagcagag tggagcggtt cagaccatgg agcccttgat ccaggcagcc caactccttc 5400 agctgaagaa gaagacccag gaggatgctg aggccatctg ctctctgtgt acctccctca 5460 gcacccagca gattgtcaaa attttaaacc tctacactcc cttgaatgga tttgaagaac 5520 gggtgacagt gtcctttata cgaacaatcc aggctcagct acaagagagg agtgaccccc 5580 agcagctact gctggactcc aaacacatgt ttcctgttct gtttccattt aatccatctg 5640 ctctgaccat ggactcaatc cacatcccgg cctgtctcaa tctggagttc ctcaatgaag 5700 tctgaggatg cgttgtttcc aaggcaagcg agaaggaagt gcgtgctgtc ggctgaagga 5760 gtgctaggtc tgttaaatat gcccagcgta gatcaaacca tgttagagac ctgtggggag 5820 cactgaacta aacagcgggg tgcactctgg tcgttagctt ttgtacagac tgctcagaaa 5880 acacctgaag tgaggacgat tgttgcagtg ggactttcag gttaaaccaa gacacgtcag 5940 aacggacggc cgctgtgtag ctccagtcac catacaaaga tgccagttct acagagtgga 6000 agtgcctagc tttgagctgt gtatataact taagaatgtt caaactaaga ttatattaaa 6060 aacacatgac ataaa 6075 63 474 DNA Rattus norvegicus 63 attatgtgcc gaaaggggaa ggttggaccg cctggcctgg acatcggaag gaaggaagat 60 cagctgattg tccacatatt tcaccctaag gtcaatgtga gtcaggaaac catgtttggt 120 gacggaaata cctgttacac attcgactac actgtgtttg tgaaacatta caggagtggg 180 gagatcctac atacagaaca tagcgtccta aaagaagatt gtagcgaaac tctgtgtgag 240 ttaaacatct cagtgtccac gctgaattcc aattactgtg tttcagtagt tggaaagtcg 300 tctttctggc aagttaatac agaaacatca aaagacgtct gtatcccctt tctccatgat 360 gacagagaag aatcaatttg gatgctgctc gttgctcctc ttctcttcct tacaatagtt 420 gtcccggcac ttgtgtgttg ttacattaag aagaatccat ttaagagaaa aagc 474 64 5028 DNA Rattus norvegicus 64 gcctggcagg cgggagaacg ctccggagtt gtggccgtgg gcaccgggct cgcggcaaga 60 ggagcggaga gcgggcatct cctgagcgcc gtcatggctg cttaggctgc gcctgccagc 120 ggaccgacgg tgtcgcccga atccggctcg gataggtctg gttggagtct gtgcctgctt 180 gcttggcgtg tggttgttcc tgcttgattg gcacggtgcc attggcttcg tatttgggaa 240 tcggaggagt taatcttgtc tcttctcaca ggttcgagtc ctcagacctt ctgcaggact 300 ccatccatat ctgcctcgca gctgactctc ctgctcacac agaagacggc catcctagat 360 ccccagctat tgtgctgacc atccccttcc tgctccggat ctcgcctggc tgctaggctg 420 tggtgctgcc ttttcagagt caggctgtag cgactccccg ccttcgtccc ggctgggctt 480 aggtggaaca gtggttcatc tcatctcatc agcacttctg aagaagaaag tgtgagaagc 540 agaggccatg gctccttttc gctgtcaaaa atgcggcaag tccttcctca ccctggagaa 600 gttcaccatc cacaattatt cccacaccag ggagcgccca ttcaagtgct ccaagactga 660 gtgtggcaaa gccttcgtct ccaagtataa gctgatgaga cacatggcta cgcactctcc 720 ccagaagacg caccagtgca ctcattgtga aaagactttc aaccggaagg atcatctgaa 780 gaatcacctc cagacccacg atcccaacaa gatgatctac gcctgcgaag attgtggcaa 840 gaaataccac accatgctgg gctacaagag gcacatggcc ctgcattcgg ccagcagcgg 900 cgatctcacc tgcggcgtct gcaccctgga gctggggagc accgaggtcc tgctggacca 960 cctcaagtct cacgcggaag aaaaggccca ccacgcgccc agggagaaga aacaccagtg 1020 cgaccactgc gagagatgct tctacacccg gaaggatgtg cgtcgccacc tggtggtcca 1080 cacaggatgc aaggacttcc tgtgtcagtt ctgcgcccag agatttgggc gcaaagacca 1140 cctcactcgt cacaccaaga agacccactc ccaggagctg atgcaagaga gcctgcaagc 1200 aggagaatac cagggcggtt accaacccat tgcgcctccg ttccagatca aggctgatcc 1260 catgcctcct ttccagttag aaatgccccc cgagagcggg cttgatgggg gcttgcctcc 1320 tgagattcat ggtctagtgc ttgcttcccc agaggaggtt ccccagccta tgctgtctat 1380 gccgccaatg cagccaatgc cagagcagcc tttcactctg caccctgggg tagttccctc 1440 ctctcctccc ccgatcattc ttcaggagca taagtacagc ccagttccta cctcttttgc 1500 cccgttcgta agcatgccga tgaaagcaga tctcaagggc ttttgcaaca tgggtctctt 1560 tgaggaattt cctctgcaag agtgtcagtc gcctgtcaag ttcagtcagt gctttgagat 1620 ggctaaggaa gggtttggga aagtcaccct gcccaaagag ctgctggtag atgctgtaaa 1680 tatagccatt cctggctctc tggagatttc ctctctcttg gggttctggc agctgccccc 1740 tcctcctccc cagaatggct tcatgaatgg caccatccct gtgggggccg gggagccgct 1800 gccccatagg ataacttgtc tggcacagca gcagccacca cctctgctac ctccgccgcc 1860 gccgctgccg ctgccagagc cgctgccaca gccacagctg ccgccacagt ttcagttgca 1920 gctccagccc cagccccaga tgcagcccca gatgcagctg cagcctctac agctgcagct 1980 gccccagctg ctgccccagc tgcagcccga gcctgagcca gagccagagc cagaggaaga 2040 agaggaagaa gaagaagaga tagaagaaga agaagagatc gaagaagaag aagaagccga 2100 accagaagca gaagaagaag aggaggcaga agacgaagag gaggcagagg aagaggaaga 2160 agagccacag ccagaagaag cccaaatagc aatgagcgct gtgaatatgg gccagccccc 2220 gctacccccg acccctcatg ttttcacagc tggcaccaac actgctatcc tgccccattt 2280 ccaccacgcg ttcagataaa ttggtttttt aagagggtgc ttctcttctg gaagatgttt 2340 caaacaccag ttccagttcc agacatcagt tacagtttga agagaagcgt tggaaaaaca 2400 ggaatggggt ttctagctta ttgccatgag tagattgaga aaaagaactc tcttaactgc 2460 atgcactgtg ccaatacata tatatatata tatatatata tatatatgta tatatatata 2520 tatatatatc atccttagta ttcatgcttt gtaccaaact tagtgagtgc gggcgttctc 2580 cgtaatcgaa ctgcaagtag tatcatatta ttaccctgat attgttagtc tcatattatt 2640 agccttgtat tattctcata taatcaaaac caagatccaa aacatgagct gctaatttgt 2700 aaatatcgtg ttgagtgtta gccgtcgtag tgatgttagc tgcgtagttg cgtgttagca 2760 ctgcctagga agggcacgag ggccaagttg ggcttctccc acttggaaga tgttttgaag 2820 agaagggggt gatctccgta gggcgtccgt aactaggccg tgtgttcttt tcagggaccc 2880 gtctaccttc aggattggat gtagtttagt cgctcttctt cttagctcgc tttgtagttt 2940 gtccttctgg tagcctactg tgtgtgtctg tgtgtagctt tataggaaag ttccgtgtga 3000 agctgtcggt gtcttcgttt tcaaaagtga attttaaatg tatttttcaa tatttttcat 3060 gtgatgttgt accaatgtga attatgactt cgtttatctt aaagacaaaa ctggttgtca 3120 gtcatatctg acaggaagaa agaaatccct gtgggtaggc aagtcaagtg gccaactaat 3180 gagaagaagc atcaatcgaa agtgttggct gactgggaca ctcatgattc tcacaggact 3240 ttgagaaacg tactggaatt aaaaaaaaaa aagcttaagt acattagata agaattttct 3300 ttgcctagct taacctacta cttaagcctc ttaagttctg aagtattgtg atcaaccaat 3360 aggaaaatgt atctgtagtt gatgaatttc agtccttgtt actttgtatc ccaagaggtt 3420 tgtgttttgg gaatgtaacc gtacttgtaa tctcagttgg tatcttgcta atcgatttga 3480 aagtgtaaaa cctaaccctt gaagactctg tatttccttt tttgagactg tatttcccag 3540 catgtatacc ctaacctttg gagactctgt attctgtttt tgagactttc cccccgcccc 3600 ccagcatatg taccccgacc cttgaagact gtatttcgtt tttgagagcg tatttcccag 3660 catatataca ctaacccttg aagactctgt atttcctttt ttgagactgt atttcccagc 3720 atatatacac taacccttga agactctgta tttccttttt tgagactgta tttcccagca 3780 tatatacact aacctttgaa gactctgtat tctgtttttg agaccccccc ccagcatatg 3840 taccctaacc cttgaagact gtatttcgtt tttgagaacg tatttcccag catatataca 3900 ctaacctttg gaagactctg tatttcattt ttgagactgt gtttcttagt atacataccc 3960 taacctttga aagactccat ttttgagact tccccccccc cagcatttgt gccctaaccc 4020 ttggaggctt tgtatttttt ttttgagact tttccgccag catatataca ctaacccttg 4080 aagactctgt atttcatttt tgagactttt ttccccagca tatataccgt aacccttgaa 4140 gactctgtat tccgtttttg agattttttt ccctcagcat atatacccca acctttgaag 4200 actctgtatt tcatttttga gactttttcc cagcatatat accctaacct ttgaagactc 4260 tgtattccat ttttgagatt ttttccctca gcatatatac cctaaccttt gaagactctg 4320 tatttcgttt ttgagatttt ttcccccagc atataaacac taacctttga agactctgta 4380 tttcattttt gagacttttt tcccagcata tataccctaa cccttgaaga ctctgtaatc 4440 tgtttttttt tttttttgag actttttccc ccagcatata tacactaacc tttgaagact 4500 ctgtattcca ttttttgaga cttttttccc cagcatatat accctaacct ttgaagactc 4560 tgtatttcat ttttgagact ttttccccag catatatacc ctaacctttg aagactctgt 4620 attccgtttt tgagaccccc cccccggcat gaatacccta atctttgaag actctggtat 4680 ttcatttttg agattttttt cccctcagca tatatacact aacctttgta gactctgtat 4740 tccgtttttg agactttccc cccccagcat gtatacccta acctttgaag actctgtatt 4800 tccagcattt gtaccctacc cttgaagact ctgtatttcc cagcatttgt accctaaccc 4860 ttgaagaccc tgtatttcgt ttgtaagact tttccccagc atatatatcc tacatataat 4920 aaacgctaag catctagcaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4980 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 5028 65 2648 DNA Rattus norvegicus 65 gctggggaga cccagagcgc cgcgctgtct gctgggctca cggagacgcg gaccaccttt 60 ggtcgccccg cgccgtcctc tccgcgccgc ctggcccgcc gcctcgagcg ccctcagtcc 120 ctcaagccag atgatgaact tcctgaggcg ccggctgtcg gacagcagct tcattgccaa 180 cctgcccaat ggctacatga ctgaccttca gcgcccggag ccgcagcagc ccccacccgc 240 ccctggcccc ggtaccgcta ccgcgtcggc ggccacctca gctgcctcgc ctggccccga 300 gcgcaggccg ccacccgccc aggcacccgc gccgcagccg gcgccgcagc ccgcgccgac 360 gccgtcggtg ggcagcagct tcttcagctc gctgtcgcaa gccgtgaagc agacggccgc 420 ctccgccggc ctggtggacg cgcccgcgcc ttcggccgct tccaggaagg ccaaggtgct 480 gctggtggtc gacgagccgc acaccgactg ggccaagtgc tttcggggca aaaaaatcct 540 tggagattat gacatcaaag tagaacaggc agaattttca gagcttaacc tggtggccca 600 tgcggatgga acctacgccg tggacatgca ggtactccgg aatggcacaa aggttgtcag 660 gtccttcagg ccagacttcg tgctcatccg acagcatgca tttggcatgg cggagaatga 720 agacttccgc cacctggtca ttggcatgca gtatgcaggc ctccccagca tcaactcact 780 ggagtccatt tacaacttct gtgacaagcc atgggtgttt gctcagatgg tggccatctt 840 caagacactg ggaggggaga aatttccact cattgagcag acctactacc ccaatcaccg 900 agagatgctt acacttccta cgtttcctgt cgtggtgaag attggccatg ctcactcggg 960 catgggcaag gtcaaagtgg aaaatcacta cgacttccag gacattgcca gtgtggtggc 1020 cctcacccaa acctatgcca cagcagaacc cttcattgac gccaagtatg acatccgggt 1080 tcagaagatt ggcaacaact acaaggctta catgaggaca tccatctcag ggaactggaa 1140 gacaaacact ggctccgcga tgctggagca gatcgccatg tcagaccggt acaagctctg 1200 ggtcgacgcc tgctctgaga tgtttggggg cttggacatc tgtgcagtca aagctgtgca 1260 tggcaaagat ggcaaagact acatttttga ggtcatggac tgtagcatgc cactgattgg 1320 ggaacaccaa gtggaggaca gacaactcat cactgatcta gtcatcagca agatgaacca 1380 gctattgtcc aggactcctg ccctgtctcc acagagacct ttaaccaccc aacagccaca 1440 gagtggaaca cttaaggaac cggactccag caagacccct cctcagcggc cagcccccca 1500 agggggccct gggcaacccc aaggaatgca gcccccaggc aaggtgctgc ctccacgtcg 1560 gctcccctcg ggaccatcac tgccaccttc ttcctcttcc tcctcctctt cctcctcctc 1620 ttcctcggct cctcagaggc cgggcggccc tacctccacc caggtgaatg catcctccag 1680 cagcaactcc ctggcagagc cccaggcacc tcaggctgct ccaccacaga agccccagcc 1740 tcacccacag ctcaacaagt cgcagtccct gacaaatgcg ttcagcttct ctgagtcctc 1800 cttcttccgg tcttcagcca atgaagatga agccaaagcg gagaccatcc ggagcttgag 1860 gaagtccttt gccagcctct tttcagatta gctcttcaga tacacgaggg catccggccc 1920 aaccaggaaa ggcatctaag acattcatca acagcagtcc gccatgtttg gtggcaatgt 1980 cccatgactt ggacatgtgt ggccccttct tctgttcctt tgtgctcagt aatttgtgca 2040 gcccagaaat gaccatgtga tggtctcagg gcaggtggct accctgagaa agacctgaca 2100 gtagaggaaa gggctgtttc cctgtagtct taagagcttt attctgaagt catcgttgct 2160 gtgaattcag tggccttatt gtgacagagc catcgtgtct tactcttccc tgtgaagcca 2220 ggatctctgt aacgtaggac tcctggcagc ttacctatct cagcatatcc tagtagtcag 2280 tgctacctgt gggaaatata ttcttacctt agaagtattg ttggctctaa gttttaggag 2340 cttctcccca aagctgggtc tcctgtggtc cctgttgaaa gcacattcac ctgaccctgc 2400 actccctaga ctgctgtgga ggtcaagttc cttgtgactg tttgaaacaa gagaacaagg 2460 atccacagtg tctcctttct cccctctgga gagcagttct gttgtgtgag catcagtaac 2520 cttgaccttt cgaagctgta ctgtagaaga gaatgcatgc cgctataaca gcagtccacc 2580 aacctttgtg ttcctcacct tatgaatgtc aagaccttgg tgtaataaaa gcagcagctt 2640 taaccagc 2648 66 598 DNA Rattus sp. 66 cataatactg tcattacaaa aaaatacaaa aaactactat aaaaacattc aggggcttgt 60 caaagtgaga aaacctaaag accccacccc aggaccaact gaagcagttt tctcccagct 120 ccttcactat tcgactttta tacaactgtg ggagtggggt ggggtgaggg tcacacaggc 180 agagaggctg gaaatttccg acacagcctc caagaaaagt aagaaataag tagcttcaca 240 tatcgcaaaa gtggggcttg gaagtttggg gtggctaggc cctgagttca gagatatggg 300 aaggaacctg tgatcctgaa tctcttggtg gggaacagct gccacctgat cccaaagccc 360 ttttccttcc tggtgatggc tgggagatgg gtcctgcccc acctcttagc tgtggtggat 420 ggttcaagcc ctcctttctt ctctccaccc ctgaacactc cccagagcag ctgaggggga 480 cagatcctcc aactctgtgg ggtgtaggca aggcagctgt tgggaaggct cctaagggtt 540 ggctcaactc cctgccccaa atgaaaagtg tctcgtgctc aaatgacagc cttgttca 598 67 3105 DNA Rattus sp. 67 aaattaaatc taaggacttt cagatttatg gctttgatca cactgtttct agagaaatct 60 aaacctggaa ggctgagtta agccagacat tccagatggc tctctcctca tagtccttgg 120 aatcacgaag gaagcagggc agagagctac cagaagtagt aaacattgat cacaggctcc 180 tagttcatcg tgaccaaatc aaaaggaatg tttctccatg gcccattaac tgtctgttag 240 tttgaacgta acatggtgat agccagactg gagctacctg agtcctgttc cagggaatct 300 tagggcaatt acctacataa cccttctgga cctcaactgc ctgatcttag ggattaataa 360 catctattta ccagagcgac tgcattgtga agggttccaa acactcctgg cacagagtaa 420 gcactgtctg ggctttggat agaaatctct tctgcaccat gagctcattt ataagacttt 480 ccaggtctgg aattgtacaa cccaaacagc tcatatcaat gtcacaagct cttcggtttg 540 gcaaaatgtc tgggagtcac caaatgcaga gaatgccata ttcaacaaag cctgataacc 600 aaggactcag tggactaatt ggcagtccta tcccagatcc aaggttcctt gagccagggg 660 caagctagga tatgctccca ggtatcttct cccttaggac tttaggtttc ttggccactt 720 cctcttattt cagtgaaagc agatccactc cattgacact tgtggtcaca gtctagcacg 780 actgctccct tccttctttt ctccctccct gcgcagcttc atttgctccc agtagtggct 840 ggaaaaacac caaattccaa tccgcggttt ctcccttcta cttcctggaa acatccaagg 900 gctcggcact tactcagcag attcaaacct tccactttcc atcactcatc gaggatgatg 960 ctgctccttg gcaccaacca ccctgcctga ctccaccctc tggcttacaa taaaaggctg 1020 aggcagagcc gctagaaatg cagagacaca gacagaggcc agcccagaaa ccagccaact 1080 ctcactgaag ccagatctct cttcctccac cactatgcag gtctctgtca cgcttctggg 1140 cctgttgttc acagttgctg cctgtagcat ccacgtgctg tctcagccag gtgagacccc 1200 agtttccttc tccttctagc atttcacccc attttttaat tgttgtgggc catcatagtg 1260 ggccttacct agtaaaatac tttttttttt ttaccaaggt aaggagcata gagccaaccc 1320 aattacaggg gttgcttctg gaaagcaact aggattttaa tcgttagatc aaagtttaga 1380 atcgcacctt catacagttc ctgctcccct atttcctgag tatttgagaa cctggttgat 1440 caaagaaggg cttgggttgg ttcatttttc cagatagagg agaatcagga agagacccag 1500 gatcttgatc tatgtttcac cagcttccag agatagcagc tcagcagagg tagttggtat 1560 cagagatact catgattcga tatagggttt ttttttgtaa cctatagtaa tgtactcggt 1620 aatcttctca gaccctagta atttgacttc taactaccct caaatgacag tccctagctt 1680 taatggcatc cctctgtcca agattgtgaa cttactttaa gtgtgtcaga gatcaccttc 1740 cagctctgat gtattggcat ttacatccca atctgctgaa actgccttct cctcatggtc 1800 cttttcttct ctaaggtcag aagcaccttt ccagttctaa tgtgctccct gcttctcttt 1860 tattctccag atgcagttaa tgccccactc acctgctgct actcattcac tggcaagatg 1920 atcccaatga gtcggctgga gaactacaag agaatcacca gcagcaggtg tcccaaagaa 1980 gctgtagtgt gagttataca ccccagccct ccctggtcca atatttttcc tcgagaacaa 2040 gggatggtct tcatagactt agaatcagtt acatgctcag ctccaatatc aagtggttcc 2100 caatggggaa actgaggcca agaagggaaa gttaattctc agcagcactg tctctatggc 2160 tgctgttcgg ggccttccat ttgcatgagc ttattgtagt aaacttgcag aagaggaagg 2220 tcactttgag tccccctttc tacctgccct cccacctcga gccctacaca gtccctccat 2280 gtatagcagg ttaaacttca tctaaccgtg tcttctctct ttccacagat ttgtcaccaa 2340 gctcaagaga gagatctgtg ctgaccccaa taaggaatgg gtccagaagt acattagaaa 2400 actggaccag aaccaagtga gatcagaaac tacagtcttc tataaaattg catcaaccct 2460 aaggacttca gcacctttga atgtgaactt gacccataaa tctgaagcta atgcatccac 2520 tctcttttcc acaaccacct caagcacttc tgtagaagtg accagtatga cagagaacta 2580 gtgtgatttg gaatgtgatg ccttaagtaa tgttaaactt atttaactta ttgatattac 2640 actattccct tccatgaata ctagaaatcc ttaaatgcaa gatgtagatc cattttttta 2700 tttctctgtg aatcctggtt caacactttc aatgtatgag agatgaatgg gtaaactttg 2760 tgtttgagag tccaaggtat tgtttaaaat attattatgg atattcctaa ttattaaaag 2820 aaatatatta tttttgtaca caagtctgac tttcggtgtt ttctgaggga aatggcaaag 2880 ctaagagtac ataagaacac acaggaggac atcacaagat gggacacata ttgagggggg 2940 gatgggggaa tgaatgctgc actcttttgt attgagtggt ctcatgtgag tgtcataaac 3000 tctttgagac agggtccagt cagggatgct agtaccatag ttccaatccc caggactgct 3060 tctcagacac atgctcgata aaagccccag tccttcccag tcatg 3105 68 2619 DNA Rattus norvegicus misc_feature (1)..(2619) where n may be a or g or c or t/u, unknown, or other 68 ggcaccaggc tcgcctcacg ccggcggcgc ggactgggaa gcggcgccct cggctgcgct 60 cggttcggaa cccgctctgt ggagtcagag gcagcgacgg gcgccaggcc gcccgggccg 120 gcggaggcgg ccgctgaggg gagtgtgagg aggcagcggc ggccctggga atgtgaggcg 180 gcgcggagcg ggagagaaga gccggggtgg gggcacgagc cgcggctgcc gtgcctcaga 240 cgagccgcga gcctctcgct tcagcgcgcg accgccggga gcttgggggc tgggtgcggg 300 ggaccgcgcg gggctgggtt ctctgattca ttcattctcc gcggcccgga gcccgaggcc 360 gcgggtgctg cgaggagagg cgggaagagg gggcggccgc gagcggggct cctttgtgct 420 cggcgggggc cgggaagccg tgggaggtgg gctcgggccc cctgcgctgc tccccggcgg 480 ccgctgcgcc cccagctagc cgccagcctg gaaatggctc cgctgatgct cctcgtgaga 540 acgaatcgat ccttcccagc cttctctgcc tgctctccac ctcctctctg ctccgagtct 600 taggagaacg aacattcaaa ggacagattc caatgtggtg tgctgtgcac atcgcgagcg 660 gctggggttt gcacttcgag atttcttctt tataattttt tttttttaat gtaagggaga 720 cagtggaatt gctacccgta gaatttttat tcaagtgcac gtcgcgttgg gttgcacgct 780 ccacccccag ggacctggtg tggtgaaatt tgaacccacc gccttagccc aaaggccgag 840 taacctggct gcttgagtgt cgtggaagac gtgagcgaaa tgatcagcga actcattttt 900 tatcagactc actgaagctg gcttttgcgt ttttctacac gtacactaat tttatggaat 960 agttaaagtg ctatattctc cgcgcaacct tttcaaattc caaatgtttg aacgttttgg 1020 tgtcagcgcg agtgaaatca ttttaccgac aagaactaac tgaattgtct gcctcgttga 1080 gttgcctccg gaaaagatct cgggggtgga aaagcaactg caaaataaca gacggagaaa 1140 attccttgga agttatttct gtagcataag agcagaaact tcagagcaag ttttcattgg 1200 gcaaaatggg ggaacaacct atcttcagca ctcgagctca tgtcttccag atcgacccaa 1260 acacaaagaa gaactgggta cccaccagca agcatgcagt tactgtgtct tatttctatg 1320 acagcacaag gaatgtgtat aggataatca gtctanacgg ctcaaaggca ataataaata 1380 gcaccatcac tccaaacatg acatttacta aaacatctca aaagtttggc caatgggctg 1440 atagccgggc aaacactgtt tatggactgg gattctcctc tgagcatcat ctctcaaaat 1500 ttgcagaaaa gtttcaggaa tttaaagaag ctgctcggct ggcaaaggag aagtcgcagg 1560 agaagatgga actgaccagt accccttcac aggaatcagc aggaggagat cttcagtctc 1620 ctttaacacc agaaagtatc aatgggacag atgatgagag aacacccgat gtgacacaga 1680 actcagagcc aagggctgag ccagctcaga atgcattgcc attttcacat aggcagccaa 1740 ttttctttgc tggagaactt ctctggcctt ggttattttt gaaattagtg aagtcaccat 1800 ttcccttgga ggaagaggat ctgcagttta ttctgattga aaagatacaa atgacccctg 1860 actgagagaa acccctggaa gaaaggagac agcctaaaaa gacaactaat gtcgtgcact 1920 tttaacattg aagaatgaaa agaagggagt tctttttaaa gaagacacct gcttgaattc 1980 tgggtgagga ggacggctaa actagagcca tctagagaaa gaaaagatgg atttgggaag 2040 ttttcatgct ttagaactta ggagagtatc atggtccttg cttcactgat atccagaaga 2100 ctgagaagcc cttgatgaag ggaggaaaat ctgctggaaa tcaaagtaac tttgcgatga 2160 agaaaagtaa aaaatccaag tggctggctt tcttggactt ctaactagac tggttcatgg 2220 attttatttc ttttgaaact taacaaacta ggatctcttg gctatgtaat cttggcacat 2280 gtaactgccc aagttcatgt attgtctgtc atgaatcctc cgccctacct cacagggatg 2340 ttgtgaggcc catgttaatg tctgtaagta gtaattcatg cttagctctt tgttgatagt 2400 ctctgtgtct ttttaaaatt ataagtgctt tattacacct tcaaacaaac attaggtaat 2460 attttttaaa tggaagataa tggcattgta tgtggttcat cactgagcat ccttgtatga 2520 agaataattt ggctgcatga gatgttaagt tgattctttt acgattgggc ctttatatga 2580 tagtaataat aataaaagct atgctaagaa aaaaaaaaa 2619 69 1057 DNA Rattus norvegicus 69 cggcaacaga cgattgttct cagagccact ttgatggcag ctatgaggct caccctgttc 60 cgcattgtgt gtctgctgcc aggctgcctg gccctgccac tgtcccagga agccggagaa 120 gtgaccgcac ttcagtggga acaggcgcag aattatctta ggaaatttta ccttcacgac 180 tctaaaacaa agaaggccac cagtgcagtg gacaaactga gggaaatgca gaagttcttc 240 ggtttgccgg agactggaaa gctgtccccc cgtgtcatgg agataatgca gaagcccagg 300 tgtggagtgc cagatgttgc agaattctca ctaatgccaa acagtcctaa gtggcattcc 360 agaactgtca cctacagaat cgtgtcctat actacagact tgcctcggtt cttagtagat 420 caaatcgtga aaagagctct cagaatgtgg agtatgcaaa tcccactgaa cttcaagagg 480 gttagttggg ggactgcaga catcataatt ggcttcgcaa ggggagatca cggagacaac 540 ttcccatttg atgggccagg aaacactcta ggccatgcct ttgcaccggg gccaggcctc 600 ggcggagatg ctcactttga caaggatgag tactggacgg atggtgagga ctcaggagtg 660 aacttcctgt ttgttgccac tcatgaactt ggccactctc tgggtctggg tcactcttct 720 gttcccagtt ctgtgatgta ccctacctat caaggagatc attcagaaga cttcagtctt 780 acaaaggacg acattgcagg catccagaag ttatatggaa agaggaacaa gctgtgatag 840 atgcagacag tttctggaat gagcaaacgc ccttcctgag ccacacttac tcctttcttc 900 cttgtactgt ggatgggttt tgcacatccc tctgagggtc attttgatgg aatgagtctg 960 acaaatctca ggtaacacga cagacaccag caataaatgt catgtgacat cagcaataaa 1020 tgtcatgtgt gcaaataaaa aaaaaaaaaa aaaaccg 1057 70 1912 DNA Rattus sp. 70 cagattagga tcagcgagca cttgaggact tagggccaca aaaaaccgca caagatcgac 60 agactatttc tggagagctg cagaacgggc acgctggggt cgctggtgct ggccatggtg 120 atggaggtgg gcatcctgga cgccgggggg ctgcgcgcgc tgctgcgaga gcgcgccgct 180 cagtgcctgc ttctggattg tcgctccttc ttcgccttca acgccggcca catcgtgggc 240 tcagtgaacg tgcgcttcag caccatcgtg cggcgccgcg ccaagggcgc catgggcctg 300 gagcatatcg tgccgaacac cgaactgcgc ggccgcctgc tggccggagc ctatcacgcc 360 gtagtgctgt tggacgaacg cagcgccgcc ctggacggcg ccaagcgcga cggaaccctg 420 gccctggccg cgggcgcgct ctgccgagaa cgccgctcca ctcaagtctt ctccctccaa 480 ggaggatatg aagcgttttc ggcttcctgc cctgagctgt gcagcaaaca gtccaccccc 540 atggggctca gcctcccgct gagtactagt gtgcctgaca gtgcagaatc cggatgcagc 600 tcctgtagta cccctctcta cgaccagggg ggcccagtgg agatcctgtc cttcctgtac 660 ctgggcagtg cttaccatgc ttcccggaaa gatatgctcg acgccttggg tatcactgct 720 ttgatcaacg tctcggccaa ttgtcctaac cactttgagg gtcactacca gtacaagagc 780 atccctgtgg aggacaacca caaggcagac attagctcct ggttcaacga ggcgattgac 840 tttatagact ccatcaagga tgctggagga agggtgtttg tgcactgcca ggccggcatc 900 tccaggtcag ccaccatctg ccttgcttac ctcatgagga ctaaccgagt gaagctggac 960 gaggcctttg agttcgtgaa gcagaggcgg agtattatct cccccaactt cagcttcatg 1020 ggccagctgc tgcaatttga gtcccaagta ctggcccctc actgttctgc agaagctggg 1080 agcccggcca tggctgtcct tgaccggggc acctctacta caacggtctt caacttccct 1140 atctccatcc ctgttcaccc cacgaacagt gccctgaact accttcaaag ccccatcaca 1200 acctctccga gctgctgaag ggccagggga ggtgtggagt ttcacgtgcc accgggacga 1260 cactcctccc atgggaggag caatgcaata actctgggag aggctcatgt gagctggtcc 1320 ttatttattt aacacccccc ccccaaacac ctcccgagtt ccactgagtt cccaagcagt 1380 cataacaatg acttgaccgc aagacatttg ctgaactcag cccgttcggg accaatatat 1440 tgtgggtaca tcgagcccct ctgacaaaac agggcagaag ggaaaggact ctgtttgagc 1500 cagtttcttc ccttgcctgt tttttctaga aacttcgtgc ttgacatacc taccagtatt 1560 aaccattccc gatgacatac acgtttgaga gttttacctt atttatttgt gtgggtgggt 1620 ggtctgccct cacaaatgtg attgtctact catagaacaa cgaaatacct cactttgtgt 1680 gtttgcgtac tgtactatct tgtaaataga cccagagcag gctttcagca ctgatggacg 1740 aagccagtgt tggtttgttt gtagctttta gctatcaaca gttgtatgtt tgtttattta 1800 tgatctgaag taatatattt cttcttctga gaagacattt tgttactagg atgacttttt 1860 ttttatacag cagaataaat tatgacattt ctattgaaaa aaaaaaaaaa aa 1912 71 4665 DNA Rattus norvegicus 71 attgcttgct tcctaggggc cagggctgtt ctgagcacgt cacaccaatc caagcttcac 60 aaacacctat gggcagttga agagggggag gcctcggact tctcgctggc ctgggattcc 120 tctgtggcag cagcgggagg cctagaagga gagtcagagt gtgatcggaa atccagccgt 180 gcgctggaag acaggaacag cgtgacaagt caagaggaga gaaacgagga cgatgaagat 240 gtggaagatg agtcaattta cacctgcgat cactgtcagc aggacttcga gtctctggca 300 gacctgacgg accaccgggc ccaccgctgt cctggagatg gtgatgacga cccacagctc 360 tcctgggtgg cttcatctcc ctccagcaag gatgttgcgt cacccacgca gatgatcggc 420 gatggttgtg accttggcct cggcgaggag gaaggcggca ccggcctgcc gtacccttgc 480 cagttctgcg acaagtcctt catccgcctg agctacttga agaggcatga acagatccac 540 agcgacaagc tgccgttcaa gtgcaccttc tgcagccgcc tcttcaaaca caagaggagc 600 cgggaccggc acatcaagct gcacacaggc gacaagaagt accactgcca cgagtgcgag 660 gcggctttct cccgcaggga ccacctcaag atccacctga agacccacag ctccagcaag 720 ccgttcaagt gcagcgtgtg caaacgcggg ttctcctcca ccagctccct gcagagccac 780 atgcaggccc acaagaagaa taaggaacac ctggctaagt cagagaagga agccaagaag 840 gacgacttca tgtgtgacta ctgcgaggac acctttagcc agacagaaga gctggagaag 900 catgtgctta ccctccaccc gcagctctca gagaaggcgg acctccagtg tatccactgc 960 cccgaggtct ttgtcgacga gagcacgctg ctggcccaca tccaccaagc tcacgccaac 1020 cagaaacaca agtgccccat gtgccctgag cagttctcct ccgtggaggg tgtgtactgc 1080 cacctggaca gccaccggca gcctgattcc agcaatcaca gtgtcagccc cgaccccgtg 1140 ctgggcagtg tggcttccat gagcagtgct acacctgact cgacgcccga ccccgtgctg 1200 ggcagtgtgg cttccatgag cagtgctaca cctgactcga gcgcctctgt ggagcgcggg 1260 tccacgccag actccacctt gaagccgctg agggggcaga agaagatgcg ggatgacggg 1320 cagagctggt ccaaggttgt ctacagctgc ccctactgtt ctaagcggga ctttaccagc 1380 ctggctgtgc tagagattca tctgaagacc attcacgcgg acaaacctca gcagagtcac 1440 acgtgtcaga tttgcctgga ctccatgccc acgctctaca acctcaacga gcatgtgcgc 1500 aagctgcaca agagccacgc ttaccccgtc atgcaattcg gcaacatctc cgccttccac 1560 tgcaactact gccccgagat gttcgcggac atcaacagcc tgcaggagca catccgagtc 1620 tcgcactgtg gccccaatgc caaccccccc gacgggaaca atgctttctt ctgtaaccag 1680 tgttctatgg gctttctcac tgaatcctcc ctcacagagc acatccaaca ggcacactgc 1740 agtgtgggga gcaccaagct ggagtctccc gttatccagc ccacacagtc cttcatggag 1800 gtctactcct gcccttactg taccaactcc cctatctttg gctccatcct gaagctcact 1860 aagcacatca aagagaacca caagaacatc ccgttggcac acagcaagaa gtccaaggcg 1920 gagcagagtc cggtctcctc tgacgtcgag gtgtcttccc cgaaacgaca gcgcctctcg 1980 gggagtgcca actccatctc taacggcgag tacccctgca atcagtgcga cctcaagttc 2040 tccaacttcg agagcttcca gacccacttg aagctgcacc tggagctgct gctccggaag 2100 caggcctgcc cccagtgcaa agaggacttc gactctcagg agtccctcct gcagcatctg 2160 accgtgcact acatgaccac gtccacccac tacgtctgcg agagctgtga caagcagttc 2220 tcctcagtgg acgacctgca gaagcacctg ctggacatgc acacctttgt gctataccac 2280 tgcaccctgt gtcaggaggt cttcgactct aaggtgtcca tccaggtgca cctggccgtg 2340 aagcacagca acgagaagaa gatgtaccgt tgcaccgcct gcaactggga cttccgcaag 2400 gaggctgacc tgcaggtgca cgtcaagcac agtcacctcg gcaacccggc caaggcccac 2460 aagtgcatct tctgtggtga gaccttcagc accgaggtgg agctccagtg ccacatcacc 2520 acgcacagca agaagtacaa ttgcaggttc tgcagcaaag ccttccacgc cgtcctcctg 2580 ctggagaagc accttcggga gaagcattgt gtgtttgacc cagctgcaga gaatggcacg 2640 gccaacgggg tgccccccac ctccaccaag aaggcagagc ccgccgacct gcagggcatg 2700 ctgctcaaga accctgaggc accgaacagc cacgaggcca gtgaggacga tgtggatgca 2760 tcagagccca tgtacggctg tgacatctgt ggtgcagcct acaccatgga ggtgctgctg 2820 cagaaccacc gactccggga tcataacatc cggcccggag aggacgatgg ctcacgcaag 2880 aaggcagagt tcataaaggg cagccacaag tgtaacgtgt gctctcggac tttcttctcg 2940 gagaacgggc tccgggaaca cctgcagacg caccggggcc ctgccaagca ctacatgtgt 3000 cccatctgtg gcgagcgctt cccctcgctg ctgacgctca ctgagcacaa ggtgacccac 3060 agcaagagtc tggacacagg cacctgtcgc atctgcaaaa tgcccctgca gagtgaggag 3120 gagtttatcg agcactgcca gatgcacccc gacttgcgga actccctcac tggtttccgc 3180 tgtgtggtct gtatgcagac tgtcacctca accctggagc tcaagatcca tggcaccttt 3240 cacatgcaga agctggctgg cagctcagct gcttcctccc ccaatggcca ggggctgcag 3300 aagctctaca agtgcgccct gtgcctcaaa gagttccgta gcaagcagga cctggtcagg 3360 cttgacgtca atgggctgcc ctatggccta tgtgccggct gcatggcccg tagtgccaat 3420 ggacaggtgg gtggcctggc cccacccgaa cctgccgacc ggccctgcgc tggcctccgc 3480 tgccctgaat gtaacgtgaa gtttgagagt gctgaggacc tggagagcca catgcaggtg 3540 gaccaccgtg atcttacccc agagaccagt gggccccgga aaggtgccca gacgtcacca 3600 gtgccccgga agaagacgta ccagtgcatc aagtgccaga tgaccttcga gaacgagaga 3660 gagatccaga tccacgtcgc caaccacatg atcgaggaag gcatcaacca tgagtgtaag 3720 ctgtgcaacc agatgttcga ctccccagcc aagctccttt gtcacctcat cgaacacagc 3780 ttcgagggca tgggtggtac tttcaagtgc cccgtgtgct tcacagtctt cgtccaggcc 3840 aacaagctgc aacagcacat cttcgccgtg cacgggcagg aggacaaaat ctacgactgc 3900 tcgcagtgcc cacagaagtt tttcttccag acagagttgc agaaccacac gatgagccag 3960 cacgcacagt gagggacctc gcgacaggac acctctccgc agaaggcttg ccggagacgc 4020 cgtggggagg gccatttgaa cattacatcc aatcaaagtg tcatttgcaa cccagatgta 4080 aaactctaat gatttggcca tgaggcgctg ctattataag cagctggaaa tgaatattaa 4140 tggcagagat taaaagtatt ccatgctcag tgttttttat tgtcctgcta cagctagtgt 4200 gcttttagag ttttcgccgc agactacatt tctagtgtta gagaaacctg cttttttgag 4260 gctattgtcc tttgttcctt catgtattat attgatagtt ttttttaaag gattagtgtg 4320 attttttttg cttcttttct atttctttct ttcttgtttt tctttctccc ccttcagtta 4380 actacttttt aattgaaatt ctaggtaatt gtgcatcgtg atgtgattgc ttggctattg 4440 tctgaatatt tccttttaat tttttaatta aagactaatg ctttgattgg atttgccagt 4500 tcaccggaca gtgattaaaa ctctgtaatg aaaataatcg gtttccgtgc aactggatgg 4560 tctgctttta aatgtgactt gatctgactg cagtaactag ttcagctcaa taaagggaat 4620 ccatgcgttc acccccaaaa aaaaaaaaaa aaaaaaaaaa aaaaa 4665 72 508 DNA Rattus sp. 72 ctcttgtttc aaagtcttgt cgttcaggta atctggataa tcacctctcc tctgtcccaa 60 acactcccaa gagagtgatt gatgaactct tctttcctag cggtgtggtg tctcttggcg 120 gtgcacaccg acaggcacct gtggccacgg gcctagatcc caagatgcag gaatcggagc 180 ccaaccttga agggatgtcc tctcggagca tgaacttgtc ccaggcctac ttttgttgct 240 actacgagtc tgtgctttta aggcccgagg ggagctcact ggactccttc tctcaactct 300 gcgtctcttg ctgtttgttg ttctatgagg aaggggcacg ttccggtctc gatccggact 360 ttttagactt tctgtcaact agcctaactt cccccacatt ctttaaatcc aggtaacttc 420 ctaaaaccgc atagatcaca tcacgaatgg cctcagatca tgaattcatc tcaacagcat 480 cgtctcaaga acggtcctct tcctgttg 508 73 543 DNA Rattus sp. misc_feature (1)..(543) where n may be a or g or c or t/u, unknown, or other 73 acaaaacaac catgttagga aagacacaaa agtagaagta aatgtctaag taaaaactaa 60 ataaacatgc caagagctat gacttaaaaa tcctaatgat gcactcggct acatccagac 120 ccaagacaca aatttacttt gccacttaac aaaaaagaag cagaatgccc ttcatcttca 180 atgtattttt aaatgtttgc acattgtgtg tctgcaggca atggcagtat catcaagtat 240 ttaaacaaag tcacctttca ggttgaatga acaagtcaca tcgaaatgtg caatatattc 300 caggacagcc agggctgtta cacagacatg ctgtcaaaac aatacaaact ccccaaacac 360 ctcaatataa atgtttaaga aaacaccagt attatatagg caatataatt cattacaaaa 420 tacttttctt cataaaacta tctgctttag acacaaaagg ggtatgtggc ccttctgtgg 480 tgctaacatt accagcttac acagattcgt ctaacactgc tctangaaca gtgctgagaa 540 tgt 543 74 4562 DNA Rattus norvegicus 74 ggctccttgc tgagttctgc tccctctgct tccgaacccg ctcgggcgcc ggggcccgca 60 tccaaggtgc tgggccccgg acgtagtgcc gctgaggtcc ggagccccgg gaaccgggcg 120 gggacgttgt caggctggag ctgctcgggc cttctgactc agccgctcat ccagatttca 180 ggatttctct tctgggcggg gagcgcgtaa cttcctcatt cccgccggga ccctcgcttc 240 cagtcagtct ggcctgactt tgagccctgc tgtcaccagg gctcctccct tggacttaac 300 ctcgccccag ttcgaggcca tcccatcttg gctaggatga ctgtaaccca gacggacctc 360 tttccctatg gggactacct gaactccagc cagttgcaca tggagccaga tgaggttgac 420 actctgaagg aaggagagga tccagctgat cgaatgcatc cctttctggc catctatgac 480 cttcagcctc tgagagcaca ccccttggtg tttgctcctg gggtccctgt tatagcccag 540 gtggtaggca ccgaaagata caccagcgga tccaaggtgg gaacctgtac tctttattct 600 gttcgattga cccatggcga ctttacctgg acaaccaaga agaaattccg acactttcag 660 gagctgcatc gggacctcca gagacacaaa gtcttgatga gtctgctccc tctggctcgc 720 tttgctgcgg cccattctcc agcccgagag gcagccaatg agaatattcc ctccctaccc 780 cgaggaggtt ccgagggctc tgccagacac acagccagca agcagaagta cctggaaaat 840 tacctcaacc gcctcctgac catgtctttc tatcgaaatt accatgccat gacagaattt 900 ctggaagtca gtcaactttc ctttatccca gaccttggct ccaaaggact ggaaggggtg 960 atccggaagc gctcaggtgg gcatcgagtt cctggcttca cctgctgtgg tcgagaccaa 1020 gtttgttatc gatggtccaa gaggtggctg gtggtgaagg actccttcct gctgtacatg 1080 cgtccagaga ctggcgccat ctcatttgtt caactttttg accctggctt tgaggtccag 1140 gtggggaaaa ggagcacaga ggcacggtat ggggtgagga tcgacacctc ccacaggtcc 1200 ctgatcctca aatgcagcag ctaccggcag gcacggtggt ggggccagga gatcacggag 1260 ctggcacagg gtccgggcag agattttcta cagctacacc agcatgacag ctatgcccca 1320 ccccggcctg gcaccctggc ccggtggttt gtgaatgggg caggttactt tgctgctgtg 1380 gcagatgcca tcctccgagc tcgagaggag attttcatca cagactggtg gctgagtcct 1440 gagatttacc tgaagcgtcc agcccactca gatgactgga gactggacat tatgctcaag 1500 aggaaggcgg aagaaggtgt ccgtgtttcc atactgctgt ttaaggaagt ggaactggcc 1560 ttgggcatca acagtggcta tagcaagagg acactgatgc tactgcaccc caacataaag 1620 gtgatgcgac acccagacct tgtaacattg tgggctcatc atgagaagct cctggtggta 1680 gaccaagcgg tggcattctt gggtgggctg gaccttgcct atggccgctg ggatgatgtg 1740 cagtaccgac tgactgacct gggggacccc tccgaatctg cagattcaca gactcccacg 1800 ccaggttcag atcctgcagc cactccagac ctctcgcata atcacttctt ctggctggga 1860 aaagactaca gcaacctcat taccaaggac tgggtacagc tggaccggcc ttttgaagat 1920 ttcatcgaca gggagaccac acccagaatg ccatggaggg atgttggagt ggttgtacat 1980 ggagtagctg cccgggacct tgcccggcac ttcatccagc gctggaactt caccaagacc 2040 atcaaggcca gatacaagat acctcagtac ccctacctgc tgcctaagtc cgccagcact 2100 gcaaaccatc ttcccttcat aatcccaggc gcgcagtgcg ccactgtgca ggtcttgcgg 2160 tctgtggatc gatggtcagc agggactttg gagagctcca tcctcaatgc ctacctacat 2220 accatccgag agagccagca ctttctctac attgagaatc agttcttcat cagctgctca 2280 gatgggcgta cagttctgaa caaggtgggc gatgagattg tggacaggat cctaaaggct 2340 catgaacagg ggcagtgttt ccgagtctac gtgcttctgc ctttgctccc tggctttgag 2400 ggggacatct ccacaggggg tggcaactcc atccaggcca ttctgcactt cacctacagg 2460 accctgtgtc gtggggaata ttcaatctta catcgtctca aagcagccat ggggacagca 2520 tggcgggatt acatgtccat ctgtgggctt cgcacacatg gagagctggg cgggcacccg 2580 atctccgagc tcatctatat ccacagcaag ttgctcattg cagatgacag aacagtcatc 2640 atcggctctg caaacatcaa cgacaggagc ttgctgggga agcgcgacag tgagctagcc 2700 atactgatcg aggacacaga aatggagcca tccctcatgg atggggtgga gtaccaggca 2760 ggcagatttg ccttgagtct gcggaagcac tgtttcagtg tcattcttgg ggcaaatacc 2820 tggccagacc tggatctccg agaccctgtc tgtgatgact tcttccagct gtggcaggaa 2880 acagcggaga acaatgccac catctatgag cagatcttcc gctgcctgcc gtccaatgct 2940 actcgttccc tgcgggctct ccgggagtac gtggctgtgg agtccttggc tacagtcagc 3000 ccttctttgg ctcagtctga gcttgcccac atccggggcc acttagttca cttccccctc 3060 aagtttctgg aggatgagtc cttgttgcct ccactgggga gcaaagaggg gatgatacct 3120 ttagaagtgt ggacatagct ggggcttccg ctcagcggca gttgctagcc gttgggccct 3180 atcgtgcctg gatccttgcc ccacaccctg agttctgagg gcagtgccct ttgatccttg 3240 gggaggacat ctctgaggac tcctagagaa tcacagagga cctttacttg agaagtagcc 3300 aaagggagca ctcccaagcc tggcctggga aagcaggaga gagttctaga gaggtttgcc 3360 ttcttgtcac catgttcaga ccactatgcc acagaaccct agtcctacga ggaggcctga 3420 ggtcaagcct tttattccag gaaaagggac tcctgccctg ggtcgtcttc atctcactct 3480 tcctcctgcc cttgaacccc tacatgccct agggcctctc ccagcccgtt gctgcaaaga 3540 tggggggggg ggagtataga gccactttga ctgcagtccc caccagcggt ggtgaggaca 3600 ccttaactgc ctccaccagc ctgctgacag acactaactc tgtaccggtt caccaagcat 3660 ttcataaata aatgtgtaga aaaggccatg cttcttcttg gagatggatg ttgtcttagg 3720 tgctccctgc ctcagttcta acttctttct tctgcttgca tctctgtgct tgcttgcccc 3780 ccctttctcc ctccctccct ttccttcttc cttttccttc cttcctccct ccctttcctt 3840 cttccccttc cttccttcct tccttccttc cttccttcct tccttccttc cttccagggg 3900 gtggggtggg gttgatttcc aaatagagtt tctctgtgta gccctagctg tcctggaact 3960 ctgttaacaa ggctgccctc gaactcggag agatccacct gcctctgcct cctaagtgct 4020 tggacatagg cttgggccac agctccatct ctgtgcttct cagcagcttc tgagttcatt 4080 cacctaactc ctgaagatct gggctaattc tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 4140 tgtgtgtgtg tgtgtgtgcg tgatacatat cttcataact tttcatatgt ctgaagtatt 4200 ttgcaattag aaaactgctg aacacagata gccacagaag caggaagaaa ataaggagaa 4260 tggagaatta ggaaggcatc atagaaggca gtgggggtgg gtgagaatgt gaccagaagg 4320 tattaggttg ctctcccatc cctctagttc tcttgagaag tgaattctag ggttagtgag 4380 agatgcttca gcagaggaag atgcttcctg ttaggcttaa caacttgagt tcgatcccca 4440 ggacccacag tctaggagga gaaaacagtc tcctgaaagt tgtcctctga cctccacaca 4500 cagaccatgg cattcctgtg tgtgctcaca ataaataaat tcattttcaa aatcaaaaaa 4560 aa 4562 75 727 DNA Rattus norvegicus 75 agctgggact ggttcctgcc acctccccga gcccctgcac tcttgtctct ctctgtctcc 60 acagcagcgt caccggctcc cggatcctgc cagccgtcca gctctacagg caccatgagg 120 agcgccatgc tgttcgcggc tgtcctcgcc ctcagcttgg catggacctt cggggctgcg 180 tgtgaggaac ctcaggagca gggtgggagg ctcagtaagg actctgatct ctctctgcta 240 cctccacccc tgcttcggag actctacgac agccgctcca tctctctgga aggattgctg 300 aaagtgctga gcaaggctag cgtgggaccg aaggagacat cacttccaca gaaacgtgac 360 atgcacgact tctttgtggg acttatgggc aagaggaaca gccaaccaga cactcccgct 420 gatgtggttg aagagaacac ccccagcttt ggcgtcctca aataggccag cagtgcagaa 480 aagcactcca ctctcagacc ctggactgca tcataaagac agggttcctg tggcggtccc 540 agtgcctgcg ctcctgcttc cctgcctgca aggtcctcct gttggctccc ttccctactc 600 tgcacagatg ctgcatatga acagcctcta cccccatatc aattatggtt tctgtagtgt 660 cctgcattaa aaataccatg tctcctcctc aacaataaag ggtttttaca atggagtgac 720 tgaaaag 727 76 458 DNA Rattus norvegicus 76 ttggattccc aatactattt ttattgcaac tgcatgagag ttttttaaat tgggcattac 60 accatattac acaggattac gaacaagatt acagtacatg tccattccag tagtaccagc 120 atcacatcga aaacagcact tattctggac tgcattttac atgcaatagc tattgttcta 180 attatggatc aaaaggtttg aattttattt aaactactgt actaattgac tacacaggtt 240 taaaccccaa catcaacaac ctgaataaat ttaggctcaa atttattaca tgaatagatg 300 acaagccacc atttgtttga ctatgtataa aatcatgcat ttatattttt ttgaaacatc 360 aataacgtca gacactccat aaaaatcatt gacttcaaag aagtgaaaga ctaaaaagaa 420 agagtgcaaa ttgggggtga tagccgagca ccttttga 458 77 452 DNA Rattus sp. 77 aaaacatgat tacaagaaaa gaatacttca tttacgtgta acactgtctt catggacgta 60 ccgtggtcac ggagtagaaa tacaacatga gtatacagga aaggagagga gagaccggtg 120 tttggccctg gtggccccct cacagagaca cagaaagcag cagacacctc tgtgcattca 180 atgggacagt ggacacttgt gtttggagca gccagagagt cagggaggat gggcaaagcc 240 acctctggga ccctaatttc gtgtggtaac cccaaagaga tactggtctg cactggtgga 300 ggctggctcc tgtcaggttc ttcgcagcct tcagggacaa gctgaggctc ctgagtgccc 360 agtggtttgt cgcatagctt gctgggccca tgggctccat ggtgtgaaga gaaaaaaggc 420 caaagtgggc gactggctga tctctaggtg ac 452 78 2280 DNA Rattus norvegicus 78 cagcaagcca ccttttcact ctgtccctac tggaaagcac tgggaagagc tgcaccggat 60 agacatgaat gaggacaaag acaagagaga ctctatccag atgagtatga agggatgccg 120 gacgaataac gggtttgtcc aaaatgaaga catccaggag caggacccag actccaggga 180 cactccacag tccaacgctg ttagtatccc tgctccagag gagcctcaac taaaggtggt 240 gcggccctat gcagggatgc ccaaggaggt gctgttccag ttctctggcc aggctcgcta 300 ccgggtgcct cgggagatcc tcttctggct caccgtggtc tccgtgttcc tgctcatcgg 360 agccaccata gccatcatca tcatctctcc aaaatgcctt gactggtggc aggcaggtcc 420 catgtaccag atctacccga ggtcttttaa ggacagtgac aaggatggga atggagacct 480 gaaaggtatc caagagaagc tggactacat cactgcttta aatataaaga ccatttggat 540 cacttccttt tataaatcac ctttgaaaga ttttagatat gctgttgaag atttcaaaga 600 aattgaccct attttcggaa caatgaaaga ttttgagaat ttggttgctg cggtccatga 660 caaaggttta aaattaataa ttgacttcat accaaaccac accagtgaca aacatccttg 720 gttccaatcg agtaggacac ggagcgggaa atacactgat tattacatct ggcacaactg 780 tacccacgcc aacggtgtaa ccacccctcc caacaactgg ctgagcgtgt atggaaactc 840 cagctggcag tttgatgaag aacgaaagca atgttatttt caccagtttt tgaaagagca 900 gccggatctt aatttccgaa atcctgctgt tcaagaagaa ataaaggaaa taataaagtt 960 ctggctctca aagggtgttg atgggtttag tttcgatgca gttaaatttc ttctggaagc 1020 aaaggacctg agaaatgaaa tccaagtaaa tacatcccaa attccggaca cagtgacccg 1080 ctactcagaa ctgtaccacg acttcaccac tacccaggtg ggaatgcatg accttgtccg 1140 ggacttccgg cagaccatga accagttcag ccgggagcct ggcagatacc ggttcatggg 1200 gacggaggtg tcagctgaga gcaccgagag gaccatggtg tactatggcc tgtcatttat 1260 ccaggaagct gacttccctt tcaacaagta cttagccaca ctagacactc tttccgggca 1320 tactgtgtac gaagctatca catcctggat ggaaaacatg ccggaaggaa aatggcccaa 1380 ctggatgatt ggcggaccag agacttctcg gctgacttct cgagtaggga gcgagtatgt 1440 caatgccatg aacatgcttc tgttcacact cccaggaacc cccataactt actatgggga 1500 agaaataggg atgggagata tttccattac aaacctcaac gagcgctatg acactaacgc 1560 ccttctctcc aagtcaccga tgcagtggga caatagttca aatgcggggt ttactgaggc 1620 caaccacacc tggctcccca caaactctga ctaccacaca gtgaatgtgg atgtccaaaa 1680 aacccagccg agctcagcac tgaggctata tcaggatctg agtctactcc atgccagaga 1740 gctgcttctc agcagaggct ggttttgcct tttgagggac gacaatcact ctgttgtgta 1800 caccagggag ctggatggca tagataaagt cttccttgtg gttctgaatt ttggagaatc 1860 atcaactgtg ctaaatctac aggaaactat ttcagatgtt cctacaaaac tgagaataag 1920 attaagtacc aatccagcct ccaaaggcag tgatgttgat acccatgccg tttctctgga 1980 gaagggagag gggctcatct tggaacacag catgaagact ctcctccatc accagaaagc 2040 tttcagagac aaatgtttta tttccaaccg tgcatgctac tccagcgtgc tggaccttct 2100 gtatagctcg tgctaggcag ctctgtaaga ggtggccacc ctgcgtctgg tatgcttgcc 2160 atcacacatg caagggcctc aggaatggca tcagttctta gatatttctg tagcacgaat 2220 gcattgtttt aggtaagatt ctcaaatgtt tggaaggaca ataaaatgtt taaaagatta 2280 79 1673 DNA Rattus norvegicus 79 ggcttaagga aacttgctgc ggaggaaaga cattggctgg acacctctag atctacctta 60 agacttgaga gctaggttgg gaccatgggg acagcgaagg tgaccccatc tctggtgttc 120 gctgttactg ttgccacaat cggttctttc cagtttggct acaacaccgg agtcatcaat 180 gcgcctgaga caatcattaa ggactttctt aactacacgt tggaagagcg gttggaagac 240 ctaccaaggg aggggctgct gaccacgctc tggtcgttat gtgtggccat cttctccgtt 300 gggggcatga ttggctcttt ttctgtcgga ctctttgtca accgctttgg cagacgcaac 360 tccatgcttc tagtcaacct gattgccatc cttgggggtt gccttatggg cttcgccaag 420 atagcggagt cggttgaaat gctgatcctg ggccgcctga ttattggcat cttctgtggt 480 ctgtgcacgg gctttgtgcc gatgtacatt ggagaggtgt ctcccactgc cctacggggt 540 gcatttggca cgctaaacca gctgggcatc gttgttggga tccttgtggc tcaggtcttt 600 ggtttggact ttattctggg ctctgaggag ctgtggcctg gactactggg cttaaccatc 660 attccagcta tcctacagag cgcagccctt ccgttttgcc ccgagagtcc aaggttcttg 720 ctcattaaca gaaaggagga agaccaagca acggagatcc tccagcggtt gtggggcacc 780 ccggatgtga tccaggagat ccaggagatg aaagacgaga gtatcaggat gtcacaggag 840 aagcaggtca ctgtgctgga gctcttcaag tctcctagct acttccagcc ccttctcatc 900 tccgttgtcc tccagttgtc tcagcagttc tctgggatca atgctgtgtt ctattactcg 960 acgggaatct tccaggatgc gggtgtccag gagcccatct atgccacgat tggagcgggt 1020 gtggtcaaca ctatcttcac tgtagtctct ctgttcctgg tggagcgggc aggaaggaga 1080 accctgcaca tgataggcct gggaggcatg gctgtttgct ccgttttcat gacgatttct 1140 ctgttactga aggatgaata tgaagccatg agctttgtct gtattgtggc tatcttggtc 1200 tacgtggcct tctttgagat tggccctggc cccattccct ggtttattgt ggctgaactc 1260 ttcagccagg gcccccgacc agctgccatg gctgtggctg gctgttctaa ctggacctcc 1320 aactttttgg tgggaatgtt cttcccctcg gctgcggcct acttgggagc ctacgttttt 1380 atcatcttcg ctgccttcct cgttttcttc ctaatcttca cctccttcaa agtcccagag 1440 accaaaggca ggactttcga ggatattacc cgggccttcg aggggcaggc gcactctggg 1500 aaaggctctg ctggtgtgga gttgaacagc atgcagcctg tcaaggagac ccctggcaat 1560 gcctgagccg gacctcctcc ctcacctccc tccactgtgg aaagccaacc tcccctgaag 1620 tggcgagacc tcatcaggat gaaccaggac tgcttctgag cgctcgtatc aca 1673 80 2042 DNA Rattus norvegicus 80 gaacataaag tcagattgct aaacttctgt gtcgactgaa aaacatggtg aagcgagttg 60 caattgtggg agctggggtc agtggcctgg cctccatcaa gtgctgcctg gaagaaggac 120 tagaacccac ctgcttcgag agaagctgtg acttgggagg actttggaga ttcacggaac 180 atgttgaaga aggaagagcc agcctttaca actcagtggt ttctaacagc agcaaggaga 240 tgtcttgtta ctccgatttc ccttttccag aagactaccc aaactttgtg ccaaattctc 300 tgttcctgga atatctccag ctgtatgcaa cccagttcaa ccttctgaga tgcatctatt 360 tcaacaccaa agtgtgcagt ataacaaaac gcccagattt cgctgtctct ggacaatggg 420 aagtggtcac tgtctgtcaa gggaagcaaa gctcagacac ctttgctgct gtcatggtct 480 gcactgggtt tctaactaac ccacatctgc ccctggattc ctttccaggc atacaaactt 540 ttaaggggca gtacttccac agccggcagt ataaacatcc agacgtattt aaggacaagc 600 gagtccttgt ggttggaatg ggaaattctg gtacagacat tgccgtggag gccagtcact 660 tagcgaaaaa ggtgtttctc agcaccaccg gaggggcatg ggtgatcagc cgagtctttg 720 attcagggta cccctgggac atgatattca tgacgcgatt tcagaacatg ctcagaaatc 780 ttctcccaac tccagttgtg agttggttga tatcaaagaa gatgaacagc tggttcaacc 840 acgtgaatta cggtgtggct ccagaagaca ggactcagct gagagagcct gtgctgaatg 900 atgagctccc aggccgcatc atcactggga aagtgttgat caagcccagc atcaaggagg 960 tgaaagaaaa ctctgtcgtc tttaacaata caccgaagga ggagcctatt gacgtcatcg 1020 tctttgccac tggatactcc tttgcgttcc ccttcctcga tgaatcaata gtgaaagttg 1080 aggatggcca ggcatcactg tacaagtaca tcttcccggc acatctgcca aaaccaactc 1140 tggccgtgat tggcctcatc aaacccctgg gttccatgat acccacagga gagacacaag 1200 ctcgatgggt tgttcaggtc ctgaaaggtg cgactacatt accacccccg agtgtcatga 1260 tgaaagaagt caatgaacgg aagaagaaca agcatagcgg atttggcttg tgctactgca 1320 aggctttgca atccgattac ataacgtaca tagatgacct cctgacctcg atcaacgcaa 1380 aaccggacct gcgggccatg ctcctgactg acccacgcct ggctctgagc atcttcttcg 1440 gcccatgcac accttaccat ttccgcctga ctggtccagg aaagtgggaa ggagccagaa 1500 aggccatctt gacccagtgg gaccgaacag tgaacgtcac caaaactcga accgtacaag 1560 aaaccccatc tacctttgaa actttgctta aactctttag ttttctggct ttgcttgtgg 1620 ctgttttctt tattttcctg taagtgaaag atctaactgg ctttccaaat gtgtggagta 1680 taaccttcca acttctctaa tgtaacaatt tcaccttcgt aattgtaaac cacgtccaga 1740 gacacccaac ccctacctct ccccaactca cctcattggc accttcattg ctgggtctct 1800 tgctagtcca tcaggtttag tgcaagaaaa taatgtccag caattctgtt cacttaaaat 1860 gttggaagga tccaggcccc ctttcaggaa gaatctgccc ccagagagga ctctgagcat 1920 tctttcaatc taaaaaactg ctttccctag atcttaatga aaagcccaac ttcgcggaat 1980 attggtctgc actaaaatag ttctctgtgt attagttgac tacaaataaa atggaagaaa 2040 ct 2042 81 650 DNA Rattus sp. 81 gaatagcctg aggttctctc tctgtgagat gggtccatcg atcacatcca aggagaaacc 60 agaacccttg cccccaccag tgatgaggaa gtcactgagt ttggttggag ttggacctcg 120 gtcagctctg tcgtctttat agccctttaa caaaccgaag aagctttctc cggatgtccc 180 ctgcttgtcg attgaagatg tgctaccata ttccctgtgc aatgatggcc ccgacttgta 240 ggttggggag atacattcat caaaactatc cacatcaagc tcacttatgg tgatatcact 300 gttgctacgc tgacggattc ggcgaagggc tttcctgggg gagctggggt aggcttcagg 360 catgaggaat ctcgagtcca tgctctccgc tggccccgtc ttgttcttca gggtgttctg 420 gatgctcttc agcatggctg agtcattgga attgaggcta acagaactcc cctgactcac 480 aggactgcct ttggaggaca ggctctcaag gcaacttgag gtttctattt cctggcttga 540 acggctagat tcttttacat tttcctttct tggaggccaa tctgcaattc ttgccctaac 600 ccccatcttg gggactccgg gggttgaggt tatatggtga ggaccttcac 650 82 3300 DNA Rattus sp. 82 cggtcgcaca ccccccggtg tcccctcgcc tccctcgccg ccgccccctt cccccgctcg 60 cgataagaag agccggcggc aggagagggg atgaagatgg cggacgcgaa gcagaagcgg 120 aacgagcagc tgaagcgctg gatcggctcc gagacggacc tcgagcctcc cgtggtgaag 180 cgccagaaga ccaaggtgaa gttcgacgat ggcgccgtct tcctcgccgc ctgctccagc 240 ggcgacacgg acgaggtcct caagctgctg caccgcggcg ccgacatcaa ttacgccaat 300 gtggacggac tcaccgccct gcaccaggct tgcattgatg acaatgttga tatggtgaag 360 tttctggtag aaaatggagc aaatatcaat caacctgaca atgaaggctg gattccactc 420 catgcagccg cttcctgtgg atatctggat attgcagaat ttttgattgg tcaaggagca 480 catgtaggag ctgtcaacag tgaaggtgac acacctttag atattgcaga ggaggaagca 540 atggaagagc tacttcaaaa tgaggttaat cggcaaggtg ttgatataga agcagctcga 600 aaagaagagg aacgcataat gcttagagac gcgaggcagt ggttgaacag tggtcacatc 660 agtgacgtcc ggcatgcaaa gtccggaggc acagcactcc acgtggcagc ggccaaaggg 720 tatacagaag ttttaaaact tttaatacag gcaggctatg atgttaatat taaagattat 780 gatggctgga cacctcttca tgctgcagct cactggggta aagaagaagc atgtcggatt 840 ttagtggaca atctgtgtga tatggagacg gtcaacaaag tgggccaaac agcctttgat 900 gtagcagatg aagacatttt gggatatcta gaggagttgc aaaaaaaaca aaatctgctc 960 catagtgaaa agcgggataa gaaatctcca ctgattgaat caacagcaaa tatggaaaat 1020 aatcaaccac agaagacttt taaaaacaag gaaacgttga ttattgagcc agagaaaaat 1080 gcatctcgaa tcgagtctct ggagcaagaa aaggctgatg aggaggagga aggcaagaag 1140 gatgagtcca gctgctccag tgaggaggat gaggaggatg actccgagtc cgaagcggag 1200 acagataaga caaaacccat ggcttctgta actaatgctc acactgccag cactcaggca 1260 gctcctgccg ctgtgacaac acctactctg tcttccaacc aggggacccc tacatcacct 1320 gttaaaaagt ttcctacatc aactacaaaa atttctccca aagaagaaga aagaaaagat 1380 gaatctcctg catcctggag gttaggactt agaaagactg gcagttatgg tgccctggct 1440 gagatcactg catctaaaga agctcagaag gagaaagaca ctgcaggcgt gatacgttca 1500 gcttcgagtc ccagactctc gtcctctttg gataataaag aaaaggagaa agacaataaa 1560 ggaacaagac ttgcatatgt cgcccctaca atcccaaggc gactaggcag tacgtctgac 1620 attgaagaga aggaaaacag agagtcttca aatttgcgaa caagtagttc ttacacaaga 1680 agaaaatggg aagatgatct taaaaaaaat agttcaatca atgaaggatc tacttaccat 1740 agaagtacct caaatcgttt gtgggctgag gatagtactg agaaagagaa ggacagtgct 1800 cctaccgcag cgaccattct tgttgctcca actgttgtaa gtgctgcagc ttcttctacc 1860 acagccctga ccacaactac tgctggcact ctttcctcca catcagaggt cagggagaga 1920 cgcaggtcat acctcactcc tgttagggat gaagagtctg aatcccaaag gaaagcaaga 1980 tctagacaag caagacagtc tagacggtca acacaggggg tgacactgac tgacctccag 2040 gaagccgaaa agacaatagg aagaagccgt tctacgagaa ccagagaaca agaaaacgaa 2100 gaaaaagaca aagaagaaaa ggaaaagcag gataaagaga aacaagaaga aaagaaggag 2160 tcagaagtat ctagagaaga tgaatataag caaaagtatt ccagaacata cgatgagact 2220 tatgcacgtt acagaccagt gtcaacttca agttcaagca ctccgtcgtc ctcctcactt 2280 tctactctag gcagttcact ctatgcctca agtcagctca acaggccaaa cagccttgta 2340 ggtataacct ctgcctactc ccggggatta accaaagaca atgaaagaga gggagagaaa 2400 aaagaagagg aaaaagaagg ggaagataag tcacaaccta aatcaatcag agaacgacgg 2460 cgaccaagag aaaaacggag gtctactgga gtctccttct ggacacaaga tagtgatgaa 2520 aatgagcaag agcggcagtc agacaccgag gatggctcca gcaagaggga cactcagacg 2580 gattctgttt ccaggtatga cagcagttcc acgtcatcaa gcgatcggta tgactccttg 2640 ctgggtcgtt ctgcctcata cagttactta gaagaaagga aaccatatgg tagccgacta 2700 gaaaaggatg actcaactga cttcaaaaag ctttatgaac aaatcttagc tgaaaatgaa 2760 aaactaaagg cacagctaca tgacacaaat atggaactaa cggatctaaa gttgcagttg 2820 gaaaaagcta cccagagaca agaacgattt gctgacaggt cactattgga gatggaaaaa 2880 agggaacgaa gagctctaga aagaagaata tctgagatgg aagaggagct caaaatgtta 2940 ccagacttaa aagcagacaa ccagaggcta aaggatgaaa atggggcctt gatcagagtt 3000 ataagcaaac tttccaagta ggacagaaaa cacacaagcg aagcagcggg acttgcacac 3060 actccccagt ggaccacatt ggcagtcact ggacgccaga aagaacccct ggagactgtc 3120 attttccgat atcctgccaa acgccctctt atctaggagt tttgtttcgt ttaatcttct 3180 gccccacccc cttggttatc aagaccattg tttcatgtta aagccgctgc tgagaagatt 3240 ttttttcaat gactgagaaa acttgtttac agctccagca aataaagaaa gtgttcaagg 3300 83 1952 DNA Rattus sp. 83 gcggcacgag cgcccacgca gaaggcaagg tgtcccgagg ctccagggtt atgagatcgt 60 cactattcag aaccttttaa caacaggaag tggaaacatg accaaatcat acagcgagag 120 cgggctgatg ggcgagcctc agccccaagg tcccccaagc tggacagacg agtgcctcag 180 ttctcaggac gaggaacacg aggcagacaa gaaggaggat gagcttgaag ccatgaatgc 240 agaggaggac tctctgagaa acgggggaga ggaggaggat gaagatgagg atctggaaga 300 ggaggaagag gaggaggaag aggaggatga tcaaaagccc aagagacggg gccccaaaaa 360 gaaaaagatg accaaggcgc gcctagagcg ttttaaatta agacgcatga aggccaatgc 420 ccgcgagcgg aaccgcatgc acgggctgaa tgcggcgctg gacaacctga gaaaggtggt 480 accctgctac tctaagacac agaagctgtc taagatagag acactgcgct tggccaagaa 540 ctatatctgg gctctgtcag agatcctgcg ctcaggcaaa agcccagacc tcgtctcctt 600 tgtacagaca ctctgcaaag gtttgtccca gcccactacc aatttggtgg ctggctgctt 660 gcagctcaac ccccggactt tcttgcctga gcagaatccg gacatgcccc cacacctgcc 720 aaccgccagc gcttccttcc cggtgcatcc ctactcctac cagtcccctg gactgcccag 780 cccgccctac ggcaccatgg acagctccca tgtcttccac gtcaagccgc cgccacacgc 840 ctacagcgca gccctggagc ccttctttga aagcccccta actgattgca ccagcccttc 900 ctttgacgga cccctcagcc cgccgctcag catcaatggc aacttctctt tcaaacacga 960 accatccacc gagtttgaaa aaaattatgc ctttaccatg cactaccctg cagcgaccct 1020 ggcagggccc caaagccacg gatcaatctt ctcctcgggt gccgctgctc ctcgctgtga 1080 gatccccata gacaatatta tgtctttcga tagccattcg catcatgagc gagtcatgag 1140 tgcccagctt aatgccatct ttcacgatta gaggcacgtc agtttcacca ttcccgggaa 1200 acgaatccac tgtgcttaca gtgactgtcc tgtttacaga aggcagccct tttgataaca 1260 ttgctgcaaa gtgcaaatac tcgaagcttc aagggatata tgtatttatt gtcgttactg 1320 cctttggaag aaacagggga tcaaagttcc tgttcacctt atctattgtt ttctatagct 1380 cttctatttt aaaaaataat agtacagtaa agtaaaaaag gaaaatgtgt accacgaatt 1440 tcgtgtagct gtattcagat cgtattaatt atctgatcgg gataaaaaaa atcaagcaat 1500 aattaggatc tatgcaattt ttaaactagt aatgggccaa ttaaagtata tataaatata 1560 tatttttcaa ccagcatttt actacttgtt acctttccca tgccgaatta ttttgttgtg 1620 attttgtaca gaatttttaa tgacttttta taacgtggat ttcctatttt aaaaccatgc 1680 agcttcatca atttttatac atatcagaaa agtagaatta tatctaattt atacaaaaat 1740 aatttaacta atttaaacca gcagaaaagt gcttagaaag ttattgcgtt gccttagcac 1800 ttctttcttc tctaattgta aaaaaaaaaa aatagaaaag aaaagagaaa aacaacaaat 1860 tgcacaattt gagcaattca tctcacttta aagtctttcc ctctccctaa agtagaaacc 1920 agacccataa cactcaagag gaaaaaaaaa aa 1952 84 1017 DNA Rattus norvegicus 84 ttcaaagtcc taaaacgcgc ggccgtgggt tcggggttta ttgattgaat tccgctggcg 60 caggatcctc tgcagaaaga gagagcgcga gagatggaga tggacaaacg gatttattta 120 gagctgcgga acaggacgcc ctctgatgtg aaagagctgg tcctggataa ctgtcggtca 180 attgaaggca aaatcgaagg cctcacggat gagtttgaag aactggaatt cctaagtaca 240 atcaacgtag gcctcacctc catttccaac ttaccaaagt taaacaaact caagaagctt 300 gaattaagcg aaaacagaat ctcaggggac ctggaagtat tggcagagaa atgtccgaac 360 cttaagcatc taaatttaag tggcaacaaa ataaaagatc tcagcacaat agagccgctg 420 aagaagttag agaatctcaa gagcctagac ctgtttaact gtgaggtgac caacctgaat 480 gcctaccgag aaaacgtgtt caagctcctg ccccaggtca tgtacctcga tggctatgac 540 agagacaaca aggaggcccc tgactctgat gttgagggct acgtggagga tgacgacgag 600 gaagatgagg atgaggagga gtatgatgaa tacgcccagc tagtggaaga tgaagaggaa 660 gaggatgagg aggaagaagg ggaggaagag gacgtgagtg gagaggaaga ggaggatgaa 720 gaaggttaca acgacgggga agtggatgat gaggaagatg aagaagatgc tgctgaagaa 780 gaagggagtc agaagcgaaa acgagagccc gatgacgagg gccaagagga tgactaaggg 840 gaattaacct gtttggggaa attcctattg tgatttgact gtttttaccc atatcccctc 900 cccctcctat tcctgccccc cgaaacttat ttttttctga ttgtagcgtt gctgtgggaa 960 tgagaggggc aaagtgtact ggggattgcc gggggtgggg tgggggtggg aggggag 1017 85 614 DNA Rattus sp. 85 gtttgattgt ataatttaat gacaatataa acagtatagt ttgttttttt tctttttctt 60 tttttaaaaa aaaagactaa agcaaaaatg attaaaagct aacagaagct accgtgacat 120 tggtttgaca ttcaaagttt gagtcttagc aaaacggcca aaggtatctt gacttgatac 180 agagtatata atataaagac ttttagacct aaaaatcttc aaacgttatt tgaatttagc 240 aaaagcaaaa atttcatatc aaagtgctaa acagtgctgc cttaaaggtc actgcaaaca 300 aaggataaaa taactgcgtg ggaagtgaca tttattgtac aaatggttaa taaaaagaca 360 cattataaat atatatgtaa cctgctatgt ttttatatat atatatatgc ttatttaatt 420 tctaaccggt gtatccaagt caccacgaac accccatttg attattctgt aactcagcct 480 ccaggagctg gtggtcttgc aataaataca ggcaaagcga ttacaataga acgtgcatac 540 aaatgttcat acaaataagg acactatgca acgaatcatt tcataataat atgccatcaa 600 cagtagaaac atag 614 86 481 DNA Rattus norvegicus 86 ttccaaggca agtaaagttg attttaatat tgttctctta caggcagatt aaacaaacag 60 gcaaagtaca tacatagtac atggcagtct taagtgacct caaggtgtat tattaactca 120 gactgtgcta gcaaaaattc cagtctctta ggatgtaaga tatttttatc acaatgcatt 180 gccacagttc ctcctttttg ttttattaaa aaaaccatgt gggtggaggg ggtcactggt 240 gaagaactaa gaactgatca agcatggtac tgtagggata gcctgaatgc accacttcat 300 tctaggtgat ccttctgaaa agttggcaaa ggcataacag gacttgagaa ggaaaacaga 360 atgcattacc tgtagaagat cacacttaag atttattccc tgtactaact taaagaatga 420 attcatcttc tccaaccttc ccctcccaag aagattcact ggaatgactt acaggctgca 480 g 481 87 458 DNA Rattus sp. 87 ccggagctgg ggaccgaatt ttaggctttc ttgattgact ggctggcacg gctggagaat 60 ccgacttcac gaagagcaga gggctggcaa gaggagggca gttgagcagc agagcttcct 120 tagccaagtc agagggctgc cacgtctctg cttccaaagg tctcctgctg ctgggagcag 180 ggtgatggcc ccaggttagc ggaagtctgc gttgaaagct ctgccaatga ctagccgcct 240 cacctcactg gtcccaccta cgatctcgta cagtttggca tctcgaagaa agcggcccat 300 ggggaagtca ttgatgtagc catttccacc taaacattga atgccatcca gggctacttg 360 tgtggcacac tcggctgtat acagaatcac accagcacag tccttggcag tgatgtggcc 420 ctcgtcacag gccctggcga cattgtagac atactgtc 458 88 30000 DNA Rattus norvegicus 88 ggatcctcaa gggccccaga gaacttttat aaggaagaag accaaaatat cccaaccaca 60 gctttcacct agccctcctg agttcttttt ccagagctat tttagaggag tatctgactg 120 ggaagaaatt gggcttggta ccttgagctg gaaggccatg gagtcattct taaagagctt 180 atcccagctc tgggaggaca gagaacaccc caactgctct cagactatcc aagtgacctt 240 tagtgctctg aacaggacag gcaccccaca caggtggaat aacatactga aggactgggc 300 aggagccaga actccacttc tcaggaatgg ccagttgcaa gttctaagca aggtagccat 360 gagagaggtc cagggctggg gtctcctatt acttctcagt ccctcctccc agacaggtag 420 ggcctctcat cagatttctt gaacttaact gaaaagccac ataacatcta gatctttggg 480 gaatctcaac atagagtccc actctttagc actgagccca gactgggtgt cacctgtccg 540 gcaaatgaga gattagaggg ctagggtgat cctggtcacc ccaagggctg gctgacttgc 600 ctgccactca agccaaatcc atctgtgtct tcctgaaagc tccacccaga gccagggtga 660 cagacctctg aactagacat aggtcttctt gctagtactt ggtggttggg aagcctcctc 720 caacagtgtt aagaacccct tcctagtcac cccctctcca caggcccacc taaagaaatg 780 tcacaggtcc ctagtgagtc ctgtccctga caaaggaaga ctagaagtga gcatgaggtt 840 ggatggatag acattagacc caaaagaaga aagaaggagg ggcctgggac ctcctacagg 900 aacaccccag aggtaagtta gcacagcctg aggtgtggtg ccatgccggg atggagctac 960 gtgagctctg cgagcacaag cagaagcagg ggctggaaat tgggatggtt gaagtccgtt 1020 gagcctccag agaaagccac tggctgtcgc tgcctaggtc ttgggtgggt gagcagatag 1080 ggcacaccac tgctcaagtg ccacaaactc tctggagggc atatgagtca tgggtaaagg 1140 gaacaagaat ctttgggtcc ttgggccttg tggggcaagc tgaacttttc aggaaattca 1200 atggactcat gctatggagt tcttggggca cagaagtatg gtccagacgt ccagatctct 1260 atctatagcc tcactcctgc agtctttcag gcctcaaggg agaggaaggt tgctcccaag 1320 actcttaatc tgcccaagct gaagcacatc ccccatacct tcctgagacc tcctgcaact 1380 ctagtttagg gttctccttc ctactggtct tgaggagagg aagcaaaata ctagagatag 1440 gcttgttctg actcccaact cctccaggag tggacaaaaa gcagtcactc aaatggaatc 1500 agtgcccagg gagcagaggt cgaatcagac tcagtctttg atcagcatca ccctgagtgc 1560 taaccccaga catcccagcc agatgttcct ttaaggccca ggttactcca tggaggtcag 1620 ttctagacct ggctcagata ctcctgctgg cctcgctgag gtccttcagc acagccctgc 1680 agcccaggag ctccttcctg ccttggaacc aggggtcctc ttgtctatga ggcctgcagt 1740 gataatcgtg tttacccagc tagctctgtt gttgctttgg tatttgagct gggactgggt 1800 cgtttcagga cctgctccct aacaactccc agccttgctg gggcccctca tcgcctctgc 1860 ctaccatctg gagggccact gtattactaa ggatagccag tatacctgtg ctgtactctt 1920 gcctggattg atgaggcaat tcagatccac atatgtggga gagactcata ctaataccac 1980 acatggggag actgaggcct ggagccaggg cagccactct tctgttctgc tcagtactat 2040 tctccctcac agtataagtt ccccagacac ctaatatggt cttccaatgt agctgcctcc 2100 agtttggctt ccacagattg tattgtctct gggtacaagt gagctggcac ctcaggctag 2160 ccccagggac tgtatttgct ttctcgcatt atcaataaat gttcactgta cttgtgttct 2220 cccaggctga acaggccatg agtgatatga gaggtggggt caaggatgct gaggtccatt 2280 tttacccttt gagatcctta tacccagcaa ggttctgaac caaggagcag aaccaggtag 2340 ctctcaagtc aacctgccca gaaaagttgg cactaaagag tatcaagcac tatgtggctc 2400 tctagcctaa tgtaaatgag ggagacccct gaggagcctt tccacaaact cagatagggt 2460 gcccctcatg ctggtaacta ctgtttggga tttgtgaaat ctggacaacg tagccagaaa 2520 tggctagtcg tgaaagtgtc cagcagaggg caccaaaacc ttgcttttgc agtactccaa 2580 gtccgaaccg cctaccataa gtccttctag gggctgcata gtgtctccct ttgctcccct 2640 gaagtcacag gaacaaagag gcttcccaag aagctggctg ccctgaattc tggagagcat 2700 gaggtttggg taagaaagac tggatgtgca cctttggggt ttagtttctt agatacccat 2760 ggcggaggat gaacaaatgt ccgggtgtct gcattattcc tagaatgtcc cgggttagag 2820 tcatctccta cctgtctaga attacagccg cccagaggag agtgtaaggg gcaatccatg 2880 cagcacctca cactacggct aagcaaacag gtcaggggag ccgactgtct gtttccagct 2940 gggacctttg tggttcccag agccctagtg gacacagaga ccgggagcca gaggagccag 3000 taatccctct ctttgcttgg agttttctta ggcacatagg ctggaagaca aaactctagt 3060 tttcacatgg aataggggct acagggaccc cagggtgcag gatgcttcat gcagagtcca 3120 tctgggacgg ggtcagccct ttcccatgtc taccttccgc tcagtgctga gaagggaagg 3180 tcagttttgc aaagaatgct ggggggtggt gtggccaggg gtaggagagc tgaagaacag 3240 cctggatacc gtgtctcttt tctcttcccc catccaacta ttgtttctgg ctgggccaca 3300 gaacagcaag cgattccttc gaaagtgaag ctgtggggtg gccctgtgtg gtccagtcat 3360 gctgtgctgg caccaagtta cccagtggct ttgagccagc ccccttggaa tcactggctc 3420 actgcctggc tcccaggtag cttgttccac caccaccacc accaccttgg tacaaacacc 3480 tcacctataa caccttctgc tgcaggtagt cagaattctg tgtctgatct gcagacccct 3540 gaggcagacc cttgaatctg ggcctaggct gagctatagg tttaccagca tgctgctctc 3600 tccaggggcg gggggctggg gagtaagcca gggctcctca aactaaaagt tgtcgtggtg 3660 atgcatcacc cactaggtgc tccgtccata tactcaaact gactagattc agttatctga 3720 cctctgggct aaggccactg actcttggtg tgccagccaa gtcccaaagg ctatgtgtgg 3780 ctcccacctg tagactaatg atagattctt taccattttg ccaacaacag aagtatgtct 3840 taaagctgcc tggaggtcca cagaccctga tgtttcctag aggcctcctc ccctgcagtc 3900 ctgcgaagcc tttagaatcc tgctgctaac attccaccag gacctccctc tggctctccc 3960 agcccatctg tgtttgtgga caacagagtg gagtccctca gccctgccag cttgtggctg 4020 ttcatgctga ggccctggca ctcccccctc tgctcttcaa aggctaccct ggcttagaat 4080 tgagtcctgc tccagctggg tctccagtct tctgtccaat ccctaagacc cactgtgacc 4140 tggactgctg aggacccgga agactcctcc tagcttttat gggccaggtc ttgagcatgg 4200 gtcagcaaca cctggtgacc cagaagcatt cagaatgggc ctgaggacca gaggtaaact 4260 gaggcaccca aatctttgcc ctggcaaatg cttcctgact ttcccccagc ctgacatctg 4320 gggatgaaga acagcttcag ttggctaggg aacctgaaaa agaggacgtt cccaacctgg 4380 acccaaatat tgaaaaggag atgggagggg ctggttgggt gaggaaaaaa gctgacagag 4440 atctagcaag gaaggctccc tatatcccag ctctagctgc acaggtcaga tttaaatatg 4500 agtgggagct tcctgaagca atgttcatat tctatagttg cctcctcata cggccagaca 4560 ctgctgtgat tggactcatc ctgagttctt ctctaagacc tggcatatgg gctggccatg 4620 gcacaagcaa gaggtggcta tagctacaaa ttatacaaaa gatgtgtatt gacctcagcc 4680 ttcttgggtt gtgggtctaa agaatgaaag ctagtcagtg agtgtgaggc taatgtcttg 4740 gatatgagag tctctggctt tcatcttttg gagatagccc tggctctcat gtgggtattg 4800 tagagaagag aaaacgggta cacttcctgt atatgctccc aacatagcca tgagtataca 4860 agaggtgtca ttttctaagg gacaggatcc cccagagatg gaggagtgag gcatccactc 4920 atcagtggag aaaagttgaa tcagggagaa agggtttagt caagagtaaa ctcttgggag 4980 agaccagctg caccatgctg cagatgagga ttcctctggt cctgaggttc attgtgtatc 5040 ttgggacaag aaggtctttg ccattctttg aagtcctagc tcaatactca aggcatccct 5100 agccagagtt ccaccccaaa tcctagagtt ccctctagaa tgaagttttg tttgaacagg 5160 tagaagagcc tttatggaca ggtgccccac aactataggt tctggactgt tccaaatcca 5220 tttccacatg gttccagaaa taatctgtgt gatcagaagg aaaaatggag gttcaaggta 5280 aactgattgc ccagtgtaat gagcagaata ggtatttgaa cctaggcagt catcagcccc 5340 ctgtacaaac tctattccca tgtgatgccc agtgggtaga agtttataga atagacccct 5400 tatctccaag catgagtttt ccttggttct cagatgtgga gttgtagctt ttaagaaggt 5460 tgactcagcc ttgccaatca gccaagggga ggaaatcaat gtgcaaaggt taggaacaag 5520 ttatatgagg taggatgaga aggatttttg ttttttcctg tgttatgggt gaatcagaac 5580 ctcatattgg caagattttc taccactgaa ctacctcctc agccctaaca ccttcccctt 5640 ctctccattc ccaaggcttg ataaagcacc ttgtagttat gattgtgggg aaagaggcac 5700 gttctgaaga gtcaatgcaa tttattaaat gaccaccaga tgtcaagctc gagccgggtt 5760 tgcaaattcc tttcgggtgc aggatggtgg ggaagccaat gggtattaag aacagtgttt 5820 aatccactct tgttcatccc agggaccccc acctcttatc tagaatccaa gaattatacc 5880 tagaaagagt ccataataaa tttattttaa tataaatatg acaaagtttg cttagaacta 5940 gagttagaac ttagttcaag gtgtttgggg tacaggggtc ttaggacagg aagatatgca 6000 aagggaagac tgcctgataa gataggggag gggaagcctc agtctcaagt acaagactac 6060 agaactcact tataccatga tgctatagaa ggtctcctct ctgtaggaga aagaattaca 6120 gcatcgcctc ttaccacttt ccttccctga gtaatcccaa atccctgcag taagaacttt 6180 aaattgtgta ttgtaattaa agatatacct accagatgtc agcatgaggc cactttagca 6240 gtccaggaag gaaaactgga ttctgggaat ttctagaggt acttcagctt tgtttttcat 6300 aaatgagagt ccaggcagat tcccctgaat tagatgactc tcctctctcc aagtctcaag 6360 tgagtagatt tttttgtagt gtctttatag ttacatcttt tttttaaagt tacatttttt 6420 taaagattta ttcatttatt atatatatgt ccactgtaac tgtcttcaga tacacacgag 6480 gtacagacga ggacatcaga tctctttaca gatggttgtg agccaccatg tggttgctgg 6540 gaattgaact caggacctct ggaagagcag acggtgctct taaccactga gccatctctc 6600 cagccctata gttacatctt aatatttgtt catttgtttt ttgtttgttt gtttaatggt 6660 cccttcctcc ttcctcctgt gtcccctaga ctgtgtccat agatcatcca tgaaatataa 6720 ttttgaaggg tcatgccctt taaaatttaa ttcactgata ccttgtgccc cattcttgcc 6780 tcagtggcac catggtttag ttgctcctcc atgggcactc attgcctgca ttcttccttc 6840 cttaaacatt gaatattcta aataaaataa aataaaaata gctctggacc tattccaagg 6900 ggagttcaag gggacagagt tttgattcat ttaggaagag tagagttctt cttctctggc 6960 tgtgacctct gggatgtagc cagtttttcc ttatggcccc ttatccttct atctgctcag 7020 agggacctcc ttttgtggag cttccatgga atttcttcat gggtttagag aaagccttaa 7080 cttccctcat gccaacctct agctttatta aatcccagcc aaatgttctc agaaggcctt 7140 agtgatagca gcttgcaaca tccagtgatg ttgcaatgga caatagaggg cagcacttgt 7200 cacatacttc tagtcacagg gcccaaagta gatctagaat cttcacaaga aaaagactgc 7260 ctcactctga taagcgtgaa atgccttttg ctaagtatcc catatatgat ttggagtcat 7320 ccacaagcat agtgaaaaat cttacttgcc cttctcccat ggaataggat atctgtatca 7380 tgctgaacta cctccatcag atatgaattg ccataaagtt tatcttcccc tacagaaagt 7440 ctcattctcc ctaaacattc ctatctctac cactaagtaa gtgacttcag gtggaacttg 7500 gacaacaaat ggagagggaa tgattaagtc ttagcttgga ctgataagct gaataagata 7560 aaagattagc tggaatgcaa aagtgcaaaa ggtagagact tgtgggtttt tgtggggtga 7620 tttatttatt tatttactta tttatttatt tatttattta ttgagccttt gctgcaaaag 7680 tgcaaaaggt agagactcgt gggtttctgt ggagtgattt atttatttat ttatttattt 7740 atttatttat ttattgagcc ttgctgcaaa agtgcaaaag gtagagactc gtgggtttct 7800 gtggagtgat ttatttattt atttatttat ttatttattt atttatttat tgagcctttg 7860 ctgcaaaagt gcaaaaggta gagactcatg ggtttctgtg gagtgattta tttatttatt 7920 tatttattta tttatttatt tattgagcct ttgctgcaaa agtgcaaaag gtagagactt 7980 gtgggtttct gtggatttat ttatttattt atttatttat ttatttattg agcctttgct 8040 gcaaaagtgc aaaaggtaga gactcgtggg tttctgtgga gtgattaatt aattaattaa 8100 ttaatttatt tatttattga gcctttgctg caaaagtgca aaaggtagag actcgtgggt 8160 ttctgtggag tgatttatta attaattaat taattaatta attgagcctt ggctgcaaaa 8220 gtgcaaaagg tagagactcg tgggtttctg tggagtgatt aattaattaa ttaatttatt 8280 tatttattta ttgagccttt gctgcaaaag tgcaaaaggt agagactcgt gggtttctgt 8340 ggagtgattt attaattaat taattaatta attgagcctt tgctgcaaaa gtgcaaaagg 8400 tagagactcg tgggtttctg tggagtgatt tatttattta tttatttatt tatttattta 8460 tttattgagc ctttgctgca aaagtgcaaa aggtagagac tcgtgggttt ctgtggagtg 8520 atttatttat ttatttattt atttatttat tgagcctttg ctgcaaaagt acaaaaggta 8580 gagactcgtg ggtttctgtg gagtgattga ttgattgaac ctttgctcta ctctctgttc 8640 tccctcaaag ccaggaacaa aggtttcaga gtttaaatga gacttcaggc aatacttgta 8700 tctgagcccc atcccagctg ctgtctgtgc cttaaagggc gtttgagtct gtgccttaaa 8760 tggcgtttga gtcccgtgcc ttaaagggcg tttgagtccg tgccttaaac ggcactgcag 8820 ctaggctaaa acggggtgag aacccaccca atcaacatcc ccaaaagggt gaacaagtgt 8880 cactatacat gccatctgca ccagtatgct tagtgcttgg catcactcat aaatggtagg 8940 ggggcaggtc agatgactga tgacagagga ttctaaatcg agtatccatg aaacatgaga 9000 caacaagatg gcgctgtgcc acaaggatac accacaccca gaaggcctta gaacgaaatt 9060 ttaatattta ttttgaatcc cgatcctaat ttgcataagc cacgcccctt tttacctgtg 9120 ccacgcccac agacattcca gggtgtcaag tgactgtcag gtgtcaatct agtgaggccc 9180 caccccctcc ccacccctgc acatagtccc taccccctag ctaacaggaa gtgcttctag 9240 cttaattcaa agccacatag acgccttcct gtctatcagg caccaaagcc ccaccctcta 9300 atgcccccat accctagtgt gggaaagcgc catagtcagc tgccccagaa gtctttgcag 9360 tactttgctg gcatatcatt tcccaaattt ggagggggct ggaaatgggc gtggagggga 9420 cgaggtaagg gataaaacct cgtagtgtca tttgagcagg tgccttgctt ggtggtagag 9480 agcagaagcc acttctaggg gctctgttat catgcaagct ctaaacatcc ctctcaccgt 9540 ctcttcagac tcagcctcgt cccctcccag ccccgccgca acttcgtcgc cccggctgga 9600 gggtctgggc tccacaacca gagcaccccc tgctttggag gaggctgcta atattggccc 9660 agccagcgga tcatcgtcca ggcaatttcg gaagagaatc ttgggcacca gtgattcccc 9720 ggtcctcttt atccaccgtc cgggaacttc gggaactacg caacgactag agtacaggta 9780 actaactgat cttcctttgt tcatgccttt tgtgtttgaa tgaaagtgca gtttaaatat 9840 tctggtatgg gtaggtgggc tggggacttg ggagagttag gatccttcca ttagtctctg 9900 aaaaggggaa ggctgccgct aaactgcatg gtcccgctaa actgcacggt ctaagagtga 9960 cttaaacttc tgaggggaca gaaaggtcaa ctgtgactta caaggcttca caaagctaca 10020 aggaaagtac gttttccttc actaatctac catgcaaagg ggatgggcag ctctctttct 10080 cttcccagca ggagtacaca gctgttttgc aaatgtgaaa agttttttct gtgtgaaatt 10140 tttcccttgg tgcatgcctc acctcccaac ccccaccgtc ccgctcccgg ctacactgtg 10200 cggaggggac agagattctg gagcagtgtg tggtgtgata tatatataat atatatatat 10260 atataatctt aaaatgatat atatatatat aatcttaaaa gcaaagtttc ctactgtttt 10320 atatcctgta cctcctaact accaccagga tactggaggg tgggtaggac cctgaaagga 10380 tgatgtgcct tgctattccc tagcagggtt taaagtacaa aagctgcttt tctcctggct 10440 gtgcctatcc ttaagcaccc tccctccagg ttccctgtcc ataggcaggg atttggagca 10500 gcagcagtag cactctcagg tgaacattca tttgtgcaca aagcttttct tccctgcatc 10560 tggcattcca gactccttaa tccacccaga gtagtgttta tggggagaag ccctgccacc 10620 ccttccagag gttgccttcc aaaagcagca gagccatgtg tcctttggct tttgctccca 10680 tccaaggtgc aggtgtgagg aaaggaatgg gagacattca cagttttagc tgtgaccctt 10740 caaaaaaaaa aaaaaaagcc ttttgtgtgc agcagcagcc ttactcaagc ctcccccccg 10800 cccccaacag ggcaggagcc agtttgtcct tcccaaagga gggtctcctg cctctccctg 10860 aacagttgtc aaacttggca aagtgacttt tgaccaactg cttcctccct tctattgaga 10920 agggggtggg gagagtaagc agctgaagag accgggcagg ggggagtgct ccaaccatga 10980 caaggtgacc tttgggcttc cccatccagc agcaagggga ggggaggggg ccagactcag 11040 ctggagactt gtttttgagt tatctcaagg atttttgctt ctcctcctca ggggccgaat 11100 agttaccact gagctcactg tgcacaggga ggaggaggag gaggaagagg aagaggagga 11160 cgactacgac gactgcagac ccttaaccca gggtcctgca tctcctcggg ctaaacttgc 11220 ccagccagac ccgcagaagc ctaagtcaag gcctgtacca aggcctgagc caaggcctct 11280 gccaccgagg cctgagccga ggcctgtgcc acaggagcct gagccacagg agcctgagcc 11340 acaggagcct gagccaccaa agcctgagcc gaggcctgtg ccacaggagc ctgagccacc 11400 aaagcctgag ccgaggcctg tgccacagga gcctgagcca ccgaagcctg agccaaggcc 11460 tgtgccacag gagcctgagc caccaaagcc tgagccgagg cctgtaccac caaagcctga 11520 gccaaggcct gtgccaccga agcctgagcc aaggcctgtg ccaccgaagc ttgagccgag 11580 gcctgtgcca ccgaggcctg agcggaggcc tgtgccaccg aggcctgagc ggaggcctct 11640 gccactaagg cctgagctaa ggcctctgca gccaaggcct ctcccaccga ggcctgagct 11700 gaggcctctg cagccaaggc ctctgcagct aaggcctctg ccaccgaggc ctgaggcaag 11760 gcctctgcca ccaacgcctg aggcaaggcc tctgccacca acacctgagc ctgagcctga 11820 gcctgagctg aagcctgagg caaaaccaga gaagaaacag gctagagcat cccgagaatc 11880 cagccccgtg cccaagtgct gtgcctgtgg cccaggagat ccccacgctc ccagaactga 11940 atgaaacctc cgaaacctcc caccagcccc agtccttcct cagaggtgag tgcccttcag 12000 gcatacaaca gagcagaggt aactctagca cagacacatc agggctcctg caaatacagg 12060 atctagccaa gtgccctgta gccaaggaac cagccaaggc tgggaagctc agtcagtcac 12120 agtatctaga agtcaggtgg gctccaggtg cctctccttg ggaacaaggg gcgtaggcaa 12180 acagacacag gcgggcagca ccagtctctg gaaaatgtag gtccttgcta tgctcccagg 12240 agtctagggg caaaggtagc aaatgctgtg gggaggtcag acagcttcca gagggaaagg 12300 caaggttcca gatgaagaaa gaatctgagg aaagagaaaa tgaaaacaaa ctgttagcta 12360 aaaagaatcg ggaggccgag aaagctgggt ttgaggggcc ctgttccccc ttcccctggc 12420 tatgcagtaa cagctgtggc gcccctgcac cagtggcttg acctctctgt gtctatcccc 12480 atttgtcata gtgtgtgaca gtctcaagct tcctggctga actgtgaaca gcaaatgaca 12540 taatgaccaa agagacattc tgagacccgt gaagggcctt aggagagagc ttgtgtccta 12600 attctattgt gttctgaagg ccatgggttt cctgtatgtg aatggggaag ggaagcagct 12660 agaaagggcg tgggaccaga agcactgtgg actcgtccga ggaagctctg ctgtttggtg 12720 gccctgcagt tgactggtca ggtactgggt ggggatctgg gaggcagagg ctgcgtctgg 12780 tggaatacct ggggatttgg ggggcagcat gtaaggtctg tgtcctgtgc ctaagcctca 12840 aggggcccaa gcctcccagc acgtgaactc ccaccccgct tccagactat tatatgccca 12900 aaagaagcca agaatctcct tttaaacatt ggttcagaca gtaaatcaat aacatagtgg 12960 tatggcctca gtctatccct acctaggaca ccggccttcc ctcctgcccc ctctccagtg 13020 cccccacagc aactcttact tacttttggt gtgggtgtgt tgatcggttt gatggtgtga 13080 tcttcatgtt gctggttcag ttagtgactt cccaggcttg cagactgaga cagagcattg 13140 cttgggtctt cccggtgtgg agtccctatg tggtgtccaa gtccgcagcc cagttgagtg 13200 tgtcctctgc agcaggcatc cagcacacta gggacccttg gagagaagtt gcgcgtgtaa 13260 aatgcttgcc ttgttccccg aggagaggga gagcagttag ctctgtggct acaggaaaag 13320 tcacaggggg gacaaggaag acccactgcc tggagggaga gaggggcgag aggcagagac 13380 cctggctaag ggtgccagcc aggcttagtg ggtaagtctg aatcttaaag ggtatttagt 13440 ttcttcagaa tttgagcttt taagcccaga aaggagacct aggctgccag gatggtgggc 13500 agggcagggc aggggagggc agggaccggg gatcagcatg gggctaaagt ctagtgggcc 13560 cagccagggc aggggagggc agggaccggg gatcagcatg gggctaaagt ctagtgggcc 13620 cagccctgcc tagaagaaac aatagacctc atgacatgca aggtaggtcc ctttctacct 13680 tggggggggg caaaaataag ggaaaggaga gggccttaga taaagaatct tgtgtccgct 13740 gaacctggct ggggattagg ctgtgtgctt ttgagcccta ggttgacccg gaactgtacc 13800 agcctctttc cctgcccctt gagccacact ttgactaaat aaggtcaggt gaaggctctg 13860 tgggcagcca cacagaggaa gaagaacaca tgcataccct gtcccccacc ctaccccgcc 13920 tgcatgccct ggtggctctt cagtgggcac ctcaaggtga cctcagcatt ttcctacctg 13980 gcaaagtcca ggagtgcatc tccggggtgt ggttctgcag cttctccaaa atacctctgc 14040 agggagtcca ggagtgcatc tccggggtgt ggttctgcag ttctccaaaa atacctctgc 14100 agggcagaaa gcagagatta ggctctgaaa caggccaata gtgtgattgc tgccctctcg 14160 tatgcagaaa gcagagatta ggcttgaaaa caggccaata gtgtgattgc tgccctctcg 14220 tatgcccctt aggaaactgc tctggcctca caaggactcg ggggtgttca cagacacctt 14280 ttcaatgtcc cttactccca acaaattctc aagtctcccc ggaaagagga acatcatcgt 14340 accccagtct tcctagctgg aagcttcctc aaatccatag gcacacactc agagaggcca 14400 gaaaagccag ggatagaggc tggtggatgg gggaaggggg gcagcatggg tgtgtgtggg 14460 gggagacagg ccagatgttc ttggaatggg gacacggggg tgattgatgc ggacctgaat 14520 ttgaaagggg aacattcccc acgtgcttca tgctccgggt ggaaaatggg tggggtgggg 14580 gctctctcag tcctgccaag ataatatgga gatgcctcgc cctgtctagg tccccacacc 14640 tgtccactga ctttaacctg ccttcccaga gagtcagcct ttgagagtcc tccctccctg 14700 catacatcct ctacgggtat tatagcgaca agccctctta cagccatggc cccagggttt 14760 tcctcattct cctgctctcc agctgctccc ctatctaaca ccccacccca acccccaggt 14820 cctgctagaa tcatggccct tcacctccag cattgcccaa ctttgagggg gtggccttgt 14880 tcccagcaac agtacggtga gcctgtgtgt gatgtgtggg tttaatattt gcctttaagg 14940 agctggactt tccccagcca ggattgggag gctagtggca aaaatttttc tgagaaaacg 15000 aaatgagggt cctagcattc ggccacggcc accactgtgc cgttttgaga cgctccttcc 15060 ccgactttct cctctactcc ccaattttct accactttcc acactgctca aacaaaactg 15120 tttcctgacg ttgttagtcg ttagggagcg tacaaggccg gataggcgtg tgccgagcgc 15180 aggggcggtg gcaggggtaa ccactggggt gccccgggcc cccaaactga gccagggatg 15240 agtgctgccc ccggcccggc cccgactgtc gcctcgcggc gcctgaggct ggggcgaggg 15300 gcagtgggga acgagcgggg cgcggaggcc cgatgagcct ggggagcagc gcgctgctcc 15360 ccgacgttcc ccctccctct cagaagccgg tcccgctcat cctctgccac ccaaacccct 15420 ggttgtccag agaaggggaa cccccccccc aaaaaaaaaa agccaaaacc ggaaaataca 15480 gctggctcag gcgcgtgctc cgtggagtcg gtgctcccag tctgcgtcgc ctgtgcccct 15540 ttccccgccc gcgttccccc gtgagggcgc ccctgctccg tgagagttcc ttggcaccgc 15600 ggatccctgg cgccccgtcc ttcccagccc ggagctcagt ttgtcaaatt cagctctact 15660 tctctgtggt tcccctaaag ccggagaact tgctggcttc tctgcacctc aggcattagt 15720 ttcactggaa tccttgaaat ttaagcattg ctttccgggg gggcctccaa aatcattcag 15780 cattgttatg ttcacaggga aaggttttgt tattttgttt ggttggctgt tttgttttga 15840 tacttgattt ccaaatattt ttaaaggtgg gaattgcgcc tctgctgact cctcaaaatt 15900 aggttatttg gcttgggccc taagtttgct tctggctcct atccgtttta tatgcaagct 15960 cctatagaga taaaaacacc accctcagtt ccctaaaatt taaagtaagc cggatcctta 16020 gtcctttttg acagaatttc cttacttggt gatggacatt ttgacagcat ttagacactg 16080 tccccagggg atgaggtagt ggggtgtttt ccccgtatcc ctatcatttt atatctgcaa 16140 taatgtttct gctgactctt tccccaccac cggccctgta gtttttatta acgtggaaca 16200 taatacacta gattctgtgg tctattgccc caaccccgaa ttttacaagt tgaatcccac 16260 ccagacctct ttaaacaagt ttccaggaat ccacaaagtc ccaacctttt ttctatgtcc 16320 tcccctaaac gtaggcacct cagtagccta cactctcagt tctcggattc cagaaaaatc 16380 aagggagtgc attcaccagt cacacctaaa attcactcaa caagaccccc cctacccccc 16440 acaccccccc caaaaaaatc caggcataca cactggtcac tgggctttgg ggtttcctag 16500 agctcatgca agctttctgg ccaccgttag gttctttaaa atataaatgt tctcactaaa 16560 gcccccaaca cagactgagc aatgatctaa gcatatgtga tcctcatatg tggtggaggg 16620 ttttccatgc tctcccaagt gatatatgac tgtaccatgt ctctctaggt cacagaaggc 16680 acaaaattaa ctcttggcca tgctttttct ttctactaga atgctgaatg tccccaatcc 16740 ttgaacactg agctcttccc cacccccggc accaaatatt aagagtccct gggtcccttt 16800 aatttaacaa gttttttaaa aagtttttaa ctctaacacc aattgcaaaa ttcagtacca 16860 tcctgtatcc ttctcatccc cctgttggat catacttggt gtcctcaaac attaggcagt 16920 atttttccag gcccccaaaa cacacttagt gttagccttg ggtcacagag aggttctggt 16980 ttctcccact gtaataccag ctatacaccc actcagcccc ctacatttca atccccccaa 17040 cccctgtcat tccacacaat ctctgcagat tctcaggccc tacactttag gaactcataa 17100 tttctattcc ttgtaaggga gggattggct ctgaagcgct ccaaggagtg agaactttag 17160 attcagaaat tatttttctg gggcccccaa attcaagtac tccaaaacaa attgagattt 17220 tttttcttcc aaagagttgg gtttttatta aactttaatg aactttattg aatttatgac 17280 ttctcccgga atctcccaat atttgggtaa tgttttatgg tcccaaagca cactggaaaa 17340 aactccagat ccccaatcta ggtttaatat ctgtgtctca caattctagc tactatctgt 17400 cctgtaatta gattcacatc cattgggtaa tttttattga tccccaaaac agggagtatg 17460 tttctctaag aacaaacatt gtccaagaaa acataccccc cttgtttaac aaaaagatca 17520 ataggaatct gtttaagaaa ttaaaatatt aaaaggcatt gatttttctg tccttctcaa 17580 atggactgat tatactccta attccccaca tttagacagc atttcccact ctctctctag 17640 aggacttctc tgaagccaca gaaattagag gtgacttttc ccgggtgctg tagggcccac 17700 tggcttcagg gtgaaactta gcctccactc ccacccccaa aagaattgac ccttaattcc 17760 ctaaaaccca tctgttgttt ttttcactct cccattttta ggcaattaaa attagctccc 17820 cccccccaaa tttcatgttg ggaattttct taaactcaac agcaacagca aaccacctaa 17880 cttctataaa atagcctaag attttgtggg ttccccaaac ttgctgacta agaccttagg 17940 ttccccacgt taggcttgga tcaagatgtt cccctttttt gcatagaata attttcctca 18000 tttcccaact tggattctaa tttttctgga atctcaaatt tagtcgtttg ccacagtttt 18060 cccatttatg gagtctcatt caagtcctcc tactttgggc cactcattcc cagggccgca 18120 aatcagacaa gggtctgatt tgggatcccc gaaatttgat cattgtttcc gctggccact 18180 ttcggcagct tcccgtcctg cgcataacat cgatatcggc ctcttcactt ctcctacggt 18240 gtcccagcgg cagctcagat tttggaaaag tgtgtgtgcc ccaaacacgc accggacact 18300 gcgcggcggc caggacgtgg gcagtgctgc tcccgtgtcc aggaaaacca ctgggcattg 18360 cccccagttt cccccaaatt tgggcattgt ccccgggtct tccaacggac tgggcgttgc 18420 cccgggacac tggggactgc ctttggggtc tcgctcacct tcagcaacgt ccacttcagc 18480 agctcccact tctgcagctc tcccgccctt ctccgatcct cctgcgccac ggacccgacc 18540 ttcggccttg cggtgagctc ccggtggcgc gctcccttgt gctccagctc ttccgttgcg 18600 gggcaccggc tacctcgcag tttgtcttca gggctgctgc ggggtagcgc gggagcgggc 18660 tgctggcttc tcccttggcg accgcgctcc tttgggccgg gcgctccgaa acacgcgcca 18720 ctttgtatgc actctgcaaa caacgcggtg gtagggaaag cgcctgagcg gccaggaagc 18780 gggaaaggca gagcgctccg gcggcttgga tcctccagcc ctacaggtcg agggcttggg 18840 gaagcaagcg tagagggcgc gcggtgattg atggtcgggc cgagaggtta gccccgggtg 18900 caccggtttc cgccagtagt agccctttag aggcctagag atgcggaggt cgggagaggg 18960 aaacagggtg cacgggttta agagtgattt tgacttggta aaactctcaa tggcttctta 19020 tatctgaaac cacccgagat agcttcctaa caaccccaaa ttatcctgtt gcccctgggg 19080 ccgaacctgc aactaagcaa attcaaagtg ctaaggacag ggaactgtta tctggacctt 19140 ctgggtgggt cccaaaggcg cgccgctgag aagcaattta gtgagctctg gcccccaggt 19200 ctggtttgag ctacgattcc cgcccctgct tctttttctg cggaagggcg cagacattgc 19260 ccggggctct acgcagggcc cagatctgtc atgccaagcc agccttccct tcatggtaca 19320 cctggtggcc ccccaaaggc tgctaggaga tcccgcgcaa agcacgaggc taggggtgtt 19380 taggatctgg ctgctattcg aggtgccttg cagcccgtcc tgcttgctca gacaggtgga 19440 ggctgtgcgc tctgcaggag ccgatccagg ggactgtccc ctggttggca ggtaggtcct 19500 tgtaggacct cgatccagct acctgcccgc gccttaactc tgcctgctgg gatgcgccta 19560 ccttgtaatc ctctaactgg gcacaggtcc cctcactacc actgaggcct ggggtgccac 19620 gcagttggtt cgcaccacgc gcccatgttg cacaagcaca gggcaccccg gaaacatgac 19680 atgtggcctc gatttttttt gggatggaca cccaggattt agtgggcctg ctggggcgca 19740 cagctcccca tctttgctac caggcaccca ttcccccctc ccccaacggc aacccccgtg 19800 cccagggccc cggacccgca ttccccccgt ggctcgagtt gcgggggcgg tcccggggcg 19860 gggcaagggc cctgcggacg cccattggcg cgggcgtaag gccagcgggg cccgagcggg 19920 cgccgagccg cggggtggcg cggctataag aaccgggcgt tggcgcccgg agttcgcctg 19980 ctctccggcg gagctgcgtg aggccaggcc ggcccccggc ccccccttcc ggccgccccc 20040 gcctcctggc ccacgcccgc ccgcgctcgg cccgccagcg cctccatccg ggctggcggc 20100 cccgcgtcga cgccgtccgc cacctcgctg ctaactcccg tgcagggcgc cgtcggcggg 20160 gcctcgctcc gtcgggcctg cggatctccc caccgcctcc tcctctatct acctcaacac 20220 cccattcctg cttcgccaga ggaggcggtc cccaccgcag gcagtccggc ttgcaggtcg 20280 ccggcgttgt catcccccgc gctccccctc ccagccctcc ccggcgcgca gcccggcagc 20340 tcctctcttt tcgctgcagt cccgagcagc cgcggcgccg ccacgcctga cccccccaca 20400 agaagccggg gcttacgacg gctgagggct ccgtcggccc taaccgagct gggtgcccgt 20460 ggccggggtg acgcctccat tcctcccccc tcaacaccgt cctcgatcct tcgaagttgc 20520 atcctttcct ctgcttagag tgcgcccccc ctcgcgcact cgcttacccg ccacctttcc 20580 taggctcccc tcctgccccc tccccgttcc tcctcgcctc agactccctc cccctcacgt 20640 ccgccctctg ccttcgccta cccaagtgga ttaattatac gctttctgtt tctctccgtg 20700 ctgtcctctc ccgctgtgag cctacccgcc tctcgctgtc ctctctccct ctctccctct 20760 ctgtgtcccc cccctttcac gttcactctg tctctctcac tatctctgcc ccccaactat 20820 ccttgataca acagctgacc tcatttcccg atacctttcc cccccccgaa aatacagtat 20880 ctggcccgcc ccagccctaa gataccctaa agaagcagaa gagacgcccc cctccccatc 20940 aaaaaaagcc atctccccgt tctgtcccgt cgcacattcg gcctctgcga cttggacaga 21000 gcggcgctgg cagaggagtg cccggcagaa gggccttcgc ccgctgttcg gtttgcatac 21060 ccgcagcagg gagatgggcg gcagcgtcgc cggcttccag gtaagggctc tcaggggccc 21120 tggggtaggg ggtgcgcagg gcccggcctg ggctccgcgg gactggtgtg gtgggtgggg 21180 ggctgggggc tcgttcccta accaaactct tgttccccct tgcgccatgg aatggggaag 21240 gggagggggt atgggtccgt tggggagggg tgtgtaagtt gaactcacag gagactttct 21300 ggtgctttgc tgtgtattgg ggaaggggga aatgaactgc ttagaatgta tgatttgtat 21360 tgtgcagacc ccgtaccccc tccgcagggt gctgggtgat gaggaggggg gcagagtctc 21420 tgaagcccac cctggtatgt tgactttgtg ccaataggac aggaaagaaa actgggtggg 21480 cggggccaga agcaccccaa aattggcaga agtcaggcga gaccccacag ttaactgttt 21540 ctccccaccc acacacgccc caatctgtcc ccaacattcc cagccaggaa tgttgggggg 21600 gcggtttcgg gctcaaaagg gcagaaatgt tacagttttg agagtaactg cccctgcctt 21660 ttactgggtg gaactttcca taggatgatg tgggaaggac ccccctcccg cccccattgg 21720 tctgtgcaga aagggctggg ggtgcacgat gaggccccct cccactggtg gtgctttgct 21780 aaggaatggt ccaaggctag ctcttggggg tgcaggagaa aagggactgg ctggaaggag 21840 ggagggggcg ggtgcaaagg gggcgagggg agtggtcagc aaggaggggg ggtgggggta 21900 gggtggagcc gggactggga ggagccgact cagacataaa aagcggaggc actgaccagt 21960 tcgcaaactg gacatttgct tctcctgtga gaaccttcca gccttttcct gtcttcatcc 22020 tcttccagcc ccagcggcct ccttatccaa cttcaggtaa ccagggccat ggagccagga 22080 ccctgctgcc atcccccctc cggcctgcca tggggtgtca gggcaccggg tagcctgggg 22140 cctctgccat tgcagccggg cctgctacct cttccaagcc ttcgccctcc ctcggcctct 22200 gtcctgtgtt cccactagcc ccaggcttcc tcacctgtcc cgccttctac ttttccttcc 22260 cctttgcctc cccactcccc acaaattcag cccttccctg ggccttcacc tatcccctac 22320 ctcctggtca gtcccctgct tgcttctcct ggacacctgg ctggaccagc cagtgtgatc 22380 tggctttggg aagggacttc gaggtcacgt ccgcctgccc tgctccacgg caccccctca 22440 cccctaagga ttaactgctg tagctatata gcctgctgat cacaagtggt tggtggccca 22500 gagtgcattg ggtgggccgt tctgggtctt gttgaggagg tgggcaaaga taactttagt 22560 tatccgaggg agcgcgagca aatttgtttc ttgattattc cgcactggtc tctcacttgt 22620 ttctttgcag caagaaatcc cctgggcctg tggagcagag gctggggctg ggcctggggg 22680 ctctacttac tgcagggtca agagtggctt actgtggggc atcgtgacct cccaccacta 22740 aaaaaaagct ggctaaccat tgttctgtag agccacacag cctgtggagg gggtgtgtcc 22800 ctccaaaatg gggtgtctag cctcccagga actcaaactg cttcctctag taccctgggt 22860 aaatttagct cttatcagaa agacggtcct atgggccata gaggtactct ctagggtccc 22920 tgaccatgga tggtaagagt tgaggtctgg ataggacttt aggttccttt tgaagctctg 22980 aggtgcccag gaaaatctat acttggagga ctgggtgacc ctggagagca aggctcacag 23040 gagcgctggc aggcccattg cgctaaccat tggattcttg acctgggccc tgctcatcac 23100 gacctgggag gcggggtgga cgggtggggt tggctctgtt gcttagggaa gggtggccag 23160 ctgggggcgg gaggttggcg actggtccct ttggatgcac atgctccacg gtgggtgggg 23220 tggagctgtc tgccatcttg gccagtactt gggaggccag agtaggagtg gcagtggtga 23280 tgtgcccttg gactggcccc acccccatga tgtcatagca cggaaaccag gtccagtgtc 23340 cagcactggc ccctgctgct gtgggtgtgg ttaaactgca attgcgccca tggatgggtg 23400 gtcctcaagg gtttcataca gttcaaagcc accacagggt gcctttgtgg gctccattga 23460 gtttctgcac gcccagagcc aggttttttt ttttttttta cctcagcaat tcaaaccctc 23520 ctgtttttgc aggttaaata aataatgaga cggacgccgt ctttcaaatt ccaattacat 23580 ttttaattaa gagatgatag gtgtctttgg tggcggggag aagccagact cttggggata 23640 ggggtgtggc ttggggcctg tagctcaaaa cctgcgtttc tttcaccagg taccaatggg 23700 gatcccagtg gggaagtcga tgttggtgct tctcatctct ttggccttcg ccttgtgctg 23760 catcgctgct taccgcccca gcgagactct gtgcggaggg gagcttgttg acacgcttca 23820 gtttgtctgt tcggaccgcg gcttctactt cagtaagtaa ctccagagag acgggggagg 23880 cgggagcaag cggggtgggg gggaggtggt aacagcacag tggtctggct aagctaccac 23940 cccacccccc cttgggcaag gcggtaatct cacactcaca agtctgatag ctttaaaagt 24000 ttttttcaaa gttaataaaa gcaaaaccta atgtggttcc caggtcctag ccaggttgag 24060 agtgtacaca tagtctgggg ctccaaggag ggagagctgg accttggcct acaccatggg 24120 gtgcttacct gcttttcaat gttcatgctt ctcttgattt cccagttggg ggaggggctg 24180 gacctgggct cactgctcac agaaggcagt gatgggggtg gggtggggga cacacgggtg 24240 gggcatgcct gggaggggca ggggcaccag aatggatgac tgtccttgct ggcctgagcc 24300 actctatctt cctcacctgg tcctggacat gcagcctcct cctcttcact tctgcctacc 24360 tgcctgtgaa ctgctctgag tgctcaaacc tctggaaact acttctgctc ctgggtactg 24420 caggaccagt ccttgttcag ggagccaatc ctgcacggag gggcttcaca gataggaggg 24480 ccccaagccc agcctcggac cgtgggggag agggggaaga cgggagaaga gaagggagtg 24540 gtttttgggt gcctcactcc tcccctcccg tcttgttctc tcctgcccta tcttcccttc 24600 ctgtcacagt tcagcgatgg gggttgaggg tgggcccctc aggctcaagg tgacaccagg 24660 ctggggggcc ccaagtccag ggaccacacc tgtgtccacc atgtcccctc gaggggctca 24720 ccccgctccc tgtttttcaa accacttcag tgtggtctct ggtcttcttg gaatctggga 24780 cagagagact gtgctgtgtt agctgccagg caggaggcac cagatgccag gggctggtcc 24840 tgtagcacgc accttcgttt ttcctttctg ggcatcttgg cctgtctggc tcccactgcc 24900 accacatcct tgcagggtaa cctagggtcc agccagggcc tagtagaagt tcaggggaag 24960 ttcctttctt cttcagcctt cccaaggggg agggtttggg gaggccaccc cagtgggtgc 25020 tgaccccagt gaacactaac tgaagctgtc tgtcctgtgg aactttcagg caggccttca 25080 agccgtgcca accgtcgcag ccgtggcatc gtggaagagt gctgcttccg cagctgcgac 25140 ttggccctcc tggagacata ctgtgccacc cccgccaagt ccgagaggga cgtgtctacc 25200 tctcaggccg tacttccggt aggtaacagg gtggggtcga aggaattgtg gggtgagaca 25260 aagaaatcac ttgctccttg atttggggtg ccatgaagtc cacccacgtt ttactgttgg 25320 cctctgccca ttaacataca tgtttctata gactctagtg gggtgggtat attgacacct 25380 ggatgggagc tcaggctaac tcgataccct gaaacctgct gactagcacc tcctctccag 25440 gacgacttcc ccagataccc cgtgggcaag ttcttcaaat tcgacacctg gagacagtcc 25500 gcgggacgcc tgcgcagagg cctgcctgcc ctcctgcgtg cccgccgggg tcgcatgctt 25560 gccaaagagc tcgaagcgtt cagagaggcc aagcgccacc gtcccctgat cgtgttacca 25620 cccaaagacc ccgcccacgg gggagcctct tcggagatgt ccagcaacca tcagtgaacc 25680 aaattatgtg gtaattctgc aatgtagtac catcagtctg tgacctcctc ttgagcaggg 25740 acagctccat catgtcccac actaaggtct ctctgctcca cttcccttcc caggtttctc 25800 cccacccacc cccatgcccc gcctccccac atcaggctgc tccccttgcc ccacaccatc 25860 gggcaagggg atcccagcaa ctcttcaaaa ccaaatttga ttggctctaa acaacccaat 25920 tggcaccctc caaattatat atgaacatta aaaaaaaact ttaaagcata tagtcccttt 25980 acaacaaatt ggcttaagaa actccataac tgataatcta aaaattaaat aaccaaagaa 26040 attaattggc taaaaacata ctaaaaatta attggcttaa aaacaattgg caaaaatcaa 26100 ataatttggc ccgccccccc ccccttcatc ttctttccat ttagatcttt agtcaaattg 26160 gctcagactt ggatctcaga acccaagaag aaaggaaggg gacccaaaat tttgcaggta 26220 gcatgtcatt gcttcagtgc tctctccttg tcactagtca cttttagcat aatctggctg 26280 tgaacaacaa tagccgccca aactctttct tcactggtca ttccatcaca aatgtcaccc 26340 atgtcaccaa ggggctgggt gaaggaaccc aaggagagga acagaacatg aaaactgaaa 26400 atagaaccta attggcacaa gcccccagtc ccaaaaatct cacttttcat acctactcta 26460 aaaagcacat gattataccc acacgtacat gcacacacac atgcacacag gcatgcatac 26520 acacacacac acacacacac acactattag atgagaacat tgaaatggct gagcaacttc 26580 gattggaacc acattgccca atccaaggcc catcttaaat tccctgagca gtttgcatgg 26640 tttgagctcc tctctgaatc catctagttt ctgctgccag tgtagagtca gtttggccag 26700 ataaggagat ggcactgcca agtgatacat gctacccgag tagcctgacc cctaggtgtg 26760 ctcctgggag gaaagatctg ggggacaacc cctaccccaa gcacacctat gggccatctc 26820 tgtcaatctc ctggggagcc cccacttttt aggggctccc caggagactc acactgatgt 26880 ggggagtgtg ggaagtctgg cggttggagg ggtgggtggg gggcagtggg ggctgggtgg 26940 ggggaaacta tgggtaggaa gtggtcccag agaggtctta ggtggaacag tcaggaggag 27000 gcacaggtca acttgcagaa ttactgaaga atcaggaccc caaattttat gtcaattgat 27060 ctattcccct ctttttatgt ctggggcagg ttttttcctt tttttttttt aatccctcct 27120 tagcttttaa tgcgctcata atcccattcc ctatgtaacg ggggcagcga tcaagtaatg 27180 aatgcatcaa gccatcaata ccagcgagag ccagtaacac cggctagagc catcaacacc 27240 ggcttccacc atgtcctgct cccaaccatt tatcaacctt tttttttttt ttatctgtct 27300 ctatcgcttg gcctgagttg ggagtggagt ctctgtgggg tgctggccac gcacccacag 27360 agaaataaaa ggaattgaga aggccgctac ctggcctgac ttctggggac agtggctggt 27420 ccccagaagt tctgaggagt ggagggggcg tggggcagtg tcccctcagg tgttaggaag 27480 gtgctcggag gccacaaaga tggggcccca gctggccctg ccagttgggg gggaagggga 27540 tgtagatgta agactagaga ggttccatca ggcgggagca agtggctgcc ttctgagcac 27600 ttgggggagg tcctccccgt gcccctcagt gtcatcttgc ccactcctca gcaccccatc 27660 ttaccctcag gaggtctgga gctctacaga cctcctgggg gcaaggtggg gtgaggcctg 27720 gagctgggga agcgaggagg ctttaaagcc ttcagagcca ggagaactgt gtacatgggg 27780 ttgtctgggc cctggggccc gagggtctgg tgagccgtag cagccactcc acggtgccta 27840 ggactgcggc ggggaacagg gcggctggag gtttacctca cccccacttc tgcttccagt 27900 gcagtccccc tgcccaacag tcctactagt aatctagagg cctgaggctt ctgggcccag 27960 gtgacaggac tggcaccacc ctgggggcgg tgtgtgtcag ccagccatgg cacagagggt 28020 tctcagcaag tgcctaaaga atgggccatt tggaacattg gacagaaact caaagagtaa 28080 attgttataa ttggagaata tgaattggcc tggtacccaa aatatctcga ggcaccctaa 28140 attacctgcc catttgactg gacatccacc cagtgttaat atgcctcgtg ggatgggtgt 28200 tttcaggggc atttgctgac catcctctgt gtccccagat ttgcagttct ccccatcata 28260 ggtcaccctg atgcaggcac ctccctggcc tcccatgcct agtgtggccc tccatcttgt 28320 tttgtctctt ccctactgtc ttcggtggga tcccctcttg ggtcccccaa tttgtcatcc 28380 tgtgaagact tcccacgcgt cgaatgccat atgtcacctg tgccactgcc catgtcatcc 28440 agcagtggcc ccgggtattt gccccaactc agtcctttta acatgcattt tctggcaaaa 28500 tccaaagctt gggttttgtt tttaacctgt taacgcttgc aaacctaata aagcattcaa 28560 aatactattt attgagttct ttgctctttc acttggggga gggggcaaag gaaaggaagg 28620 gtggcaggga gggaaggagg gagggaggga aagagtaaag ggtgcctgat gcctggggga 28680 acatggggag cagcagatga aaagcctttg aagcatgaga agaagtgggg ttcaataagg 28740 cactctgagg aactcgagag cagaggacca catttgccct caaagatggg gctaaaggga 28800 tcccatgatg ttttcgaatg agctgtgtac catgaatacc taccccaggg aacattgccc 28860 ttagccttgg gggttgggga gcaggatagg tgaagagtcc tggttggtga tggacatggg 28920 gactcagggt gaaaaacacc taggaatccc ctgcgcacac acagtcgccc ccattgttcc 28980 ctctatcaaa gctatgctgg agcctgtccc cagtagttac tagatagacc tgtccagtgg 29040 taatggtctt tgggattcct gaccctggga gtaaagtgaa gccaacccct agatgcttga 29100 aacctggcct ggatatcttc cttctgcacc tttatagtcc ttagtacccc cactttccta 29160 gtgcagatgg gcatgtaccc ttcccccatg tgggaggagc ccaggctggg gcagctgctg 29220 aggccctgtg cctgggaccc tgctcctttt ccccagttaa tgctacacac tagcatttct 29280 ttttattgtg tttagctact ttttaaaata aagagtctca cactctaccc caggcaagct 29340 tgaactgcaa atggatgtat gtccatgctg gcctggaact caacagcagt ctcagtcctg 29400 cctcagccac ctgggtgttg aaattacagg tgcaagccaa cacaggggtt agcattagct 29460 ttcagttttg tttgcactcc agggtctctg ctgagtgtgt taaagtgggt cttcagggca 29520 gccttcaaat ccaagggctg cttttaaact cagggtacac agtggtcatg ggggactgct 29580 tagaagacga agttcatcct tgaagaggaa gactggatag cagtaaggtg aaaatgaagg 29640 gcactggcca gcctctggct tgacagagtt gaggttaaga caaggcctgg atctgggcag 29700 accaccttcc cacccagtga gtctttctta tcagatgctt cagtgtcttc tatgtgggtg 29760 gaagaaggcc cagagctgcc cactcctttg aggacaattc agtgtgttct catctgtccc 29820 atcatctcct gagagcccac tataggtctg tcccaccccc ctgtcagctc ctcagttgac 29880 taggacccta aggagcaggc cttgggatat ccaggttcca gactcttacc ccttttccca 29940 agctggttcc ttcctgactt gctaaccaac atcacctgaa catggagtcc ttggcacctc 30000 89 806 DNA Rattus norvegicus 89 gcttgaatct aatacgtcga tcataccatg ttgaagatga gcgggtggca gcgacagagc 60 caaaataaca gccggaacct gaggagagag aaaccggtcc aattacagtc atggcagctg 120 agtctctgcc tttcgccttt gagacagtgt ccagctggga gctggaagcc tggtatgagg 180 atctgcagga ggtcctgtcc tcagatgaaa ttgggggcac ctatatctca tccccaggaa 240 acgaagagga agaatcaaaa accttcacta ctcttgaccc tgcatcccta gcttggctga 300 ctgaggagcc agggccagca gaggtcacaa gcacctccca aagccctcgc tctccagatt 360 ccagtcagag ttctatggct caggaagaag aagaggaaga tcaaggaaga actaggaaac 420 ggaaacagag tggtcagtgc gcagcccggg ctgggaaaca gcgactgaag gagaaggagc 480 aggagaatga gaggaaagtg gcacagcttg ctgaagagaa cgagcggctc aacgaggaaa 540 tcgagcgcct gaccagggag gtagagacca cacggcgggc tctgatcgac cgcatggtca 600 gtctgcacca agcatgaact gttggcatca cctcctgtct gtctctcccg gagtgtaccc 660 agcaccatca cgccagtgcc aagcatgtaa tctccagtgc acatgctgag gaggggactg 720 agggtagacc aaaggagagg ggcttgtaca ctgtacattc tttattcatt ccatacccag 780 taaagtgact ttgtgtgaaa aaaaaa 806 90 437 DNA Rattus norvegicus 90 tttttttttt tttctttggt ttttaattct tttttttttt attattatta ttggtttgtg 60 tgagagaggt tgagaaggtt tggtttacac tgagtatatg ttgtcaagtg gccaaagtcc 120 acatagctct cctgttttct gtatacgttc acagcctcaa aaaaaaaaaa aaaaataatt 180 gaaatggctt taaaaaccga acagaacacc tccatcctgc gataagtacc tcgaatggat 240 tcagctttac ccctctgtga ctcatcctca cattctcagc atatttaaca gaccaacaaa 300 accaagtact agtagtgagg ggctgaccca cgtggctttg cagtgctctt cgtccagaag 360 catggcacac gatgcttgtg catgtggaaa cttagcgact gtcaacatac attctcaggg 420 atttatccct cgtgccg 437 91 2340 DNA Rattus norvegicus 91 ggatccagat gagagattct ggtacggagg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 60 tgtgtgtgtg tgtgtgtgta ttgcacaaga atgaaaactg aaaaacaagc agtatataaa 120 tggctcccgg agattctgag atgctgaggc ttgcttgtat gttgctatag tgtatgttgg 180 tgcttgggag ccactgtcat ggataggtat gttgctgggt catccaagcc agtgtgtgga 240 cactcaggta caggaagcaa agtgaaggca tcagcaggca tttttgtttt acgatgttta 300 aattacactt attttatttg tgtgtacgag tgtatgggtg gggatggggc aaatgccaag 360 gggcacttct tgtgagagtc aatctgttcc ttctagcatg tgggctctgg agatcaaact 420 caggtcattg agcttggtgg caagcacctc tacctactga gccacctgtt caacacccac 480 ctgtaggcat ttgtgttcat agtagttcat agccctatga acatatagca cctaggccaa 540 gagagcctgg cttccccacc ccctcccctt gtaccccaac ctctgccact tcatctcact 600 cctactaggc agctgggttt tttccctcac tgtaggcccc tgggcaggca gccagcagcc 660 gcgcccaacg ctgggaggga gaagaatggg tcagaggctg gagcttgtgg ttgagttggg 720 gagtgagtaa gctgagtgag ggatggaaaa ctgctgttgt tgaggccagg cctggggggg 780 aggcacagaa ggctgctggc atgaatttct agagtttgag tggtaagttt tgcaagtttc 840 agagcttgaa gcacatatga gcttcttgcc atcagtgggt accactcctc tgatctccct 900 gggagtgagg tcggtctctg gaagtgctct tagagagtag gttggagtag agcactaaaa 960 acggggacag actgagtgtg acttgagtga tgcctagcaa catatatcca gctctcaaca 1020 cactgttggt gtgggttgga gaaggctact tttgtgtctc ctgcccctag gtctcaacgg 1080 tcaccatggc gagacccttg gaggaggccc tggatgtaat agtgtccacc ttccacaaat 1140 actcaggcaa cgagggtgac aagttcaagc tgaacaagac agagctcaag gagctactga 1200 ccagggagct gcctagcttc ctgggggtga gtggatcctg tctgtgtatt gcatatgtga 1260 tgcatcccca ggaggaggct ggggctggag atatctatct atctatctat ctatctatct 1320 atctatctat ctatctatct atctatctat ctatctatat ctatatcatc tatctatatc 1380 tatctatcta tctacatata tatatatata tatatatata tatatatata tatatatata 1440 tatatctctc cctactcctg gcgcttggta tggaaccaca atgaaccatc tacttcacac 1500 cagccccccg ttgagacaag gcttagaatg aagttaactg aagtggcaca ggaaaaccac 1560 attaggtagt cagtgtctga aagcacagcc tagatcagga cagtctttcc ggtgatgtgc 1620 aacagaaatc gagtttctgc ttgtgaagac atgattgtgg aggcacacaa atgcctgcag 1680 atcttcccct caatgacacc ttatcttagt taacacctcc ttgtcatgac agttacctat 1740 agacatagtt aaaacaagcg tggggaagat gtggtcacat cctttcccag ctagcccatg 1800 tgctcatctc acagttgagc cctgaggcta gcacggtgtc tgcaagcctt cctgagctcc 1860 tggctggagg tggcgtctaa ctgtacctct tctacctcca gagaaggaca gacgaagctg 1920 cattccagaa gctgatgaac aacttggaca gcaacaggga caatgaagtt gacttccagg 1980 agtactgtgt cttcctgtcc tgcattgcca tgatgtgcaa tgaattcttt gagggctgcc 2040 cagataagga gccccggaag aagtgaagac tcctcagatg aagtgttggg ccagtggggg 2100 aatcttccat gttggctgtg agcatagtgc cttactctgg cttcttcata catgtgcaca 2160 gtgctgagca agtttaataa agagttttga aactatgtct gagagactgg agattgtggg 2220 tgggtgggtg ggctgaggga gggtggtggc agagatggct ggaagttgac ctggagcttt 2280 gtggggccaa ctagaaaagg ttggggaggg gtgagtgact atctgagtca tacagtgtat 2340 92 423 DNA Rattus norvegicus 92 tttttttttt tttttttgac atagtaatta ttttttaatt caaaagctgc aggtgggagt 60 tttacaaaat caagataagc ttagtaaaaa aagttttcta aaattaaaag aaacaaacaa 120 aagaaggtga tttttctcag taccttagta taacttaact gttgctaaaa ctttgatatg 180 tctgtttcca gtctcctttt tccgcatatt attttgattt tctgttattt catgtaattt 240 tatcatatat gacatacatg tatacacata catacttaat atctgggcct atgtaagtgg 300 taactagttt gcatgtggtg aggaaaatct ggtttaagga ctgagtagac aaggggttaa 360 aatttaagct ttaacgtctg ttagcgctta tcacctaaaa ccactttgta cacatgaata 420 cac 423 93 1030 DNA Rattus sp. 93 gggcggaagg ccgcactgtt gccacaatga tttcctgttc ttcctttttc aggttgtcga 60 ccatgtgtca agctgtgtag ggaaactgcc tggagatgga cattctgact gctaatgggg 120 acatgggacg gcagttacct ctgatcactt gcgtggattc tcctggagca gaaagacaga 180 agccgccagt cactgctccg gatctagcac tgggattctg tgctgaaggg tctaaaaact 240 cgttttgatt tggatctggg agaatgtctg agcaaggaga actaaccccg accatactgg 300 aagaaggccg gacagagcca gaatctgccc cagaaaatgg catcctcaag tcagaaagtc 360 tggatgagga ggagaagctg gaactgcagc ggcgactggc agctcagaac caagagagaa 420 gaaaatccaa gtcaggagca ggtaaaggga agctgaccag aagtcttgct gtctgtggaa 480 gagtcttcag ccagacctgg aggggaaagt caccaggatc agactctctg agaactgcaa 540 acggaaagga actcaaaaga acttagatta aaacttaact tacaaatgat aaaataaaat 600 aaaataataa aatccagtaa gcacggaagt gctcgatgtc cactgaggct ctctcttgat 660 aaggctgaac caaatacacc ccaagcatcc tgtctctatg gagacagctt acctctcagc 720 tccccaaatg aacctgccta taaataacac accactgttg gctcctagga gatgagaaat 780 agtgtctaag caccctgaac tgtggagaaa tgctgctatg gccttcggga ggttgttggt 840 ttttggtttg gtttgttttt tttttttttt ttttttttaa ctactgcaaa accggcccag 900 aagagcgtct gaactggtag cgaatttgat attaaccagc tgctgcacct ccactctgtg 960 atcccccaaa gcttaaagtt ccagggccct tcgcatcctc ttcgttaccc tttttttttt 1020 ccttttttaa 1030 94 1616 DNA Rattus norvegicus 94 atggcgctga gtgacctcct ggaactcact ctgctgctgc tgctgcccct gctggagcgg 60 ctctcggccg aggactgccc gtgctccgag gcttctctct gccgaccgat ccgccaccac 120 cgcgacttcg aggtctttgt gtttgatgtt ggacagaaaa cctggaaatc ttatgactgg 180 tcgcagatta cgactgtggc agtttttgga aaatatgact cagaacttat gtgctatgct 240 cattcgaaag gagccagggt agtgctaaaa ggtgatgtgg ccctgaagga catcattaat 300 cctaccttca gagcgtcatg gatagcgcaa aaagttgcct tagccaaagc acaacacatg 360 gatggaatta atatagacat agaacaggaa gttgattgct cgtcacctga atatgaagca 420 ttgacagctt tggtcagaga aaccacagag ggtttccatc gtgagattga gggatcacag 480 gtaacttttg atgtagcttg gtcaccaaag ggcatagaca aaaggtgcta taattatact 540 gggattgcgg atgcctgcga ctttcttttt gtgatgtctt acgatgagca aagtcagatc 600 tggtcagaat gtattgcagc cgccaatgct ccctacaatc agacattaac tggatatggt 660 gactacctca ggatgggcat cagccccagg aaacttgtaa tgggtattcc ctggtatggc 720 tatgattaca tttgcctaaa cctctcaaag gatgatgtct gcgccattgc aaaggtccct 780 ttcagagggg ctccttgcag tgatgctgca gggcatcagg tgccctacag ggtcatcatg 840 aagcaagtaa acagttctgt ttctggaagc cagtggaacc aagaccagca agctccgtac 900 tataattaca aggatcccac tggccgtttg catcaggtgt ggtacgataa cccccggagc 960 atctcactaa aggcagcatt cgtcaaacac tacggcttgc ggggcattgg catgtggaac 1020 gcgaactgtc tggactactc agatgacgct ctagccagag agcaaacgga ggaaatgtgg 1080 ggagccctga gaccacggct gtgatacata cagaggaatg tcttttgtca gacggttaga 1140 tttgcaggac tggcctgtgt aagcagatct acctcttcaa gagataaaaa tatatatttt 1200 gtcatgttac cctaacttac ttattggtct actcaataca caggaaagaa ataaagaaac 1260 tattcatttg tacttggaaa atacatacat atcttaaata tccaggaatg taaaagcaaa 1320 aacaagtcaa cttactctat taaatattcc tctgtttcag tattataatt atattatatt 1380 ataattagaa aagttgcttt tagaattcca tgctgaattt ttcctcctgt tattaaaata 1440 tactgttaaa aaccaatagg gatttttaaa aatgtataac tagtagacaa gtgaagatac 1500 tggagaaggg agtgcacatg accatagtgt gttacacaca tgtatagact atcatagtaa 1560 agcccattat ttttataatt aatatatgct aataaaagat gtgtaacttg aaatgg 1616 95 574 DNA Rattus norvegicus 95 gacagttcgt gagagagaag ctttattaaa actagtggca gatctggact gctactgttg 60 cgcctttggt gcatgttgag cctgggcaag gatgggtaca tagggagata caggtgggat 120 gaggagcaaa gagggagggc gctgcctact gagactgtgc tcaactgttc tcccagaaga 180 agttgttaca agccactgtg agagcagcca ccagcacaac aaactcctgg aagtccactt 240 ccccatctcc attctcatcc agttccttca tgatcttgtc cacagcatct gcatccttct 300 ggacatccag gaagctggag agttcagttt gtagcaggtc tttcagctcc ttcttgctca 360 gcttatattt gtccccttcc ttgcccgagt gggcatggaa cacattgatg agggtctcca 420 tggcggtctc cagctcagag cccatttcag cagcacacgg ttggcctcgg cctgacgtcg 480 gagcaggttc ttgacagaag ggcgcgctgc aaatgttgcc ccggagcagt gtggggagac 540 ctgtctttac tgccgaccct cctccccttg tgcc 574 96 1312 DNA Rattus norvegicus 96 gagtgccagt catgtcgtcc atcctgccct tcaccccccc gatcgtgaag cgcctgctgg 60 gctggaagaa gggcgagcag aacgggcagg aggagaagtg gtgcgagaag gcggtcaaga 120 gcttggtgaa gaagctcaag aagacggggc agttggacga gctggagaag gccatcacca 180 cgcagaacgt gaacaccaag tgcattacca tccccaggtc actggatggc cggctgcagg 240 tatcccatcg gaaggggctc ccccatgtca tctactgccg cttgtggcgg tggcccgacc 300 tgcacagcca ccatgagtta cgggccatgg agctctgtga gttcgccttc aacatgaaga 360 aggatgaagt gtgtgtaaat ccttaccact atcagagagt agagacacca gtgctacctc 420 cagtgttggt gccacgccac accgagatcc cggccgagtt cccccctctg gatgactaca 480 gccattccat ccccgagaac actaacttcc ccgctggcat tgaaccccag agcaatattc 540 cagaaacccc acctcctggc tacctgagtg aagatggaga aactagcgac caccagatga 600 accacagcat ggacgcaggc tctccaaacc tctccccgaa tccgatgtcc ccagcacaca 660 ataacttgga cctacagcca gtcacctact gtgagccggc cttttggtgc tccatctcct 720 actacgagct gaaccagcga gttggggaga cattccacgc ttcacagccg tccatgacag 780 tagatggctt cactgatccc tccaattcag agcgcttctg tctgggcctg ctgtccaatg 840 ttaaccggaa tgcagccgtg gagcttacaa ggcggcacat tgggaggggt gtgcggctct 900 actacattgg aggggaggtc tttgcagagt gcctcagtga cagtgctatt ttcgtccagt 960 ctcccaactg caaccagcgc tatggctggc accctgccac tgtctgcaag atacccccag 1020 gctgcaacct gaagatcttc aacaaccagg aatttgctgc cctcctagct cagtctgtca 1080 accagggctt tgaggctgtc taccagttga ctcgcatgtg caccatccgc atgagcttcg 1140 tcaaaggctg gggagccgag tacaggagac agacagtgac cagcaccccc tgctggatcg 1200 agctacacct gaatgggccc ttgcagtggc ttgacaaggt cctcacccag atgggttccc 1260 ccagcatccg ctgttccagc gtgtcttaga gacactggga gtaaagggat cg 1312 97 1200 DNA Rattus norvegicus 97 catgagcgcc gctcttttca gcctagacag cccagcacgc ggcgcaccct ggcccacaga 60 gcccgcggcc ttctacgagc caggcagggt gggcaagcca ggacgagggc cggagcctgg 120 ggatctgggg gagccgggct ccacgacccc tgccatgtat gacgacgaga gcgccatcga 180 cttcagcgcc tacattgatt ccatggctgc cgtgcccacc ctagagttgt gccacgacga 240 gatcttcgcc gacctcttca acagcaatca caaagcggcc ggcgcgggca gcctggagct 300 gctgcagggc ggccctacgc gacccccggg tgtggggtca atcgccaggg gcccgctgaa 360 gcgcgaaccc gactggggcg acggcgacgc gccgggctcc ctgctgccgg cgcaagtggc 420 agtgtgcgcg cagacagtgg tgagcttggc ggccgcggca cagcccacac cacccacttc 480 gcccgagcct cctcgaggca gccctggacc gagccttgcg cctggccccg tccgagagaa 540 gggcgccggc aagaggggtc cggaccgggg cagccctgag taccggcagc gacgcgagcg 600 caacaacatc gctgtgcgca agagccggga caaggccaag cgccgcaacc aggagatgca 660 gcagaagctg gtggagctgt cggccgagaa cgagaagctg catcagcgtg tggagcagct 720 cacccgggac ctggccagcc tccggcagtt cttcaaagag ctgcccagcc cgcctttcct 780 gccgcccacc ggcaccgact gccggtaacg cgcggtgtgg gccttagaga ctccgaacga 840 ccgatacctc agaccccgac ggcggggagc agacgccgcc cgaattgcta cagtttcttg 900 ggcactggac tgcgagagaa gctatatgaa tcccccttaa attatttttt tataatggta 960 gcgttttcta cgtcttatta ccattgcagc taaggtacat tgtagaaaag acttttccga 1020 cagacttttg tagataagag gaagagactg cgcatgcttt ttatattcat ttttacagta 1080 tttgtaagaa taaagaagca tttaaattgc aaaaaaaaag aggcaccagc tctgactggc 1140 ctctttctag gctacggtga tcctgagcat cttttgttac ctgctggtag aaatgatcct 1200 98 593 DNA Rattus sp. 98 ccgggctcat taggaaacct ttattgtcac cattagagat aaaaacatcg gagtcccaca 60 cgccacacca cacaaggagg ctaactgaag ggttactgat tgtcacagca gcccagacaa 120 agggctgaac ctcggtcctt agccaccccc tcccctgcac gatcagtggc agagggcaat 180 ggtctccatg ttgaggaagc ggatgtgcat cttggtctcg atgtcgatgc cctgccagat 240 cttcaggaag tcctcaaagg tgatcccctc gtacacctgg tccggttcca tctgtcccac 300 gcacacgctg gcagcctcca tcatggctcc gtcggcgatg gaccgagctg actccttctc 360 gatgtgagga tttccagaga gcagttcctc caccacattt cgatactctt ccagggtgat 420 gcggccgtca ctgtccgagt catacatatg gaacagaaat ttcagcttct ccttccgaga 480 cagctccact tgttcctcac ccagggtagt gtcaatgggc cggaagtagg acatgatagt 540 caggaagtcc tcgaagttga tctcatcggc caggccgctg gaccccttac gca 593 99 281 DNA Rattus sp. 99 acgccggacg ccgcccacgt ctgaacttcg ggagacggag ggccccgggg agcggggccc 60 tgcgagcaga ctcccagccg cgcgacgcga tcccccccgc ggcggcgaaa ccgcaagggg 120 gggcggcgat tgatcgtcaa gcgacgctca gacaggcgta gccccgggag gaacccgggg 180 ccgcaagtgc gttcgaagtg tcgatgatca atgtgtcctg caattcacat taattctcgc 240 agctagctgc gttcttcatc gacgcacgag cccctcgtgc c 281 100 573 DNA Rattus sp. 100 ttcatcacca atgctaagta aggttgaatg gagtggtgga gtggggaatt tactgatacc 60 ctaacatggt ttgtacaccg tgagtcattg catcagagag ctcttggtct ccagaagaac 120 gtggtatcct ggttgttgcc ttactagctg atgaatcaga tttactaagt tgatcgaagt 180 tttgccgctg tgcttccccg tgaatcccct ttagcagata tggctgtgag ctgttttctt 240 gggttggtat ctttgagcat aggtgagttg ggttccttat ctcttgtcaa gactaaaatg 300 ggggaatgaa aatgcagaca gtttaatgat tatttttagg agctggcatg atatgggctt 360 tggagatgaa atagaaacaa cacaagattc aggtctctga agtcaataaa aatggaattt 420 tggaggtgca ggttttaaaa tatatttctt ttcatgagaa gcacagtatg ctgcgttggt 480 gcactgttca agtgacatgg agatccttgg ttgaagcagc cattattcta actctctaac 540 gacttgttct gaaataaaac cgtggtttgt atg 573 101 528 DNA Rattus norvegicus 101 tttttttttt tttttttact ggtactaaag gttttactgt aataaaccaa atatatttac 60 aatttataca aatgttgaaa caacaaaaat acaaaaagaa aaaaaaatca caagatgcaa 120 aaccaagaaa caagttcact ggagacacca ctgctctaca aagactcaag gtgaattgaa 180 ctgcgagaaa cagaacggta agctcgcagt cagcaaaggc aggacggcag acagcagctt 240 gctcctctat tccacacctg tggcctcggt gtgattgtgg tgtccaaagt tagacaggtg 300 agtgtctact gatagacttg gtgccttctt tttgtagacg caaagcagtt ctgtgtcaag 360 atttggaaac aggtttcttc agtttttttg atgttgtttt tgtttttttt aaattataag 420 ttctatttct acaaattcca aacatttggc ccaattcaac tgtaaatatg aagccatggt 480 attctcagta gatgacctca taaacggtag cctttttgtc ccctgagc 528 102 3509 DNA Rattus norvegicus 102 cgcccgcctg agtggccgga cctcgcgcct cgcgcctcgc gtcccgcgct gcagccgcag 60 tcggcttttg ttgtctccgc ctcctcgtcg gtccccgact ctggaccacg agcggcgcgc 120 gctgggacct tggctctgcc cttcgcgcag actgggctga gcgggccggg cggccatcca 180 gggaggcgcg cacgggcggt cggggagccg cgggccccgc catggagctc cgggcccgag 240 gctggtggct gctgtgcgcg gccgccgcgc tagtcgcctg cacccgcggg gaccccgcca 300 gcaagagccg gagctgcagc gaagtccgcc agatctacgg ggctaagggc tttagcctga 360 gcgacgtgcc ccaggcagag atctcgggag agcacctgcg gatctgcccc cagggctaca 420 cctgctgcac cagtgagatg gaggagaacc tggccaacca cagccggatg gagctggaga 480 ccgcactcca cgacagcagc cgtgccctgc aggctacact ggccacccag ctgcatggta 540 tcgatgacca cttccagcgc ctgctgaatg actcggagcg tacactgcag gatgcttttc 600 ccggggcctt tggggacctg tacacgcaga acactcgggc cttccgggac ctgtatgctg 660 agctgcgtct ctactaccga ggggccaacc tacaccttga ggagacactg gccgagttct 720 gggcacggct gctggagcgt ctcttcaagc agctgcaccc ccagctgctg ctgcccgatg 780 actatctgga ctgcctgggc aagcaggcgg aggcactgcg gccgtttggg gatgcccctc 840 gagaactgcg cctgagggcc acccgtgctt ttgtggcggc acgatccttt gtgcagggcc 900 tgggtgtggc cagtgacgta gtccgaaagg tggcccaggt tcctctggcc ccagaatgtt 960 ctcgggctgt catgaagttg gtctactgtg cccattgccg gggagtccct ggtgcccggc 1020 cctgtcccga ctattgccga aatgtgctca aaggctgcct tgccaaccag gccgacctgg 1080 atgccgagtg gaggaacctc ctggactcca tggtgctcat cactgacaag ttctggggcc 1140 cgtcgggtgc ggagtatgtc attggcagtg tgcatatgtg gctggcggag gccatcaacg 1200 ccctccagga caacaaggac acactcacag ctaaggtcat ccagggctgc ggaaacccca 1260 aggtcaatcc ccatggctct gggcctgagg agaagcgtcg ccgtgccaaa ctggcactgc 1320 aggagaagtc ctccacaggt actctggaaa agctggtctc tgaggccaag gcccagctcc 1380 gagacattca ggactactgg atcagcctcc cagggacact gtgtagtgag aagatggcca 1440 tgagtcctgc cagcgatgac cgctgctgga atgggatttc caagggccgg tacctacctg 1500 aggtgatggg tgatgggctg gccaaccaga tcaacaaccc tgaagtggag gtggacatca 1560 ccaagccgga tatgaccatc cggcagcaga tcatgcagct caagatcatg accaaccgtt 1620 tacgtggcgc ctacggtgga aatgatgtgg acttccagga tgccagtgat gacggtagtg 1680 gttccggcag cggtggcgga tgcccagatg acgcctgtgg tcggagggtc agcaagaaga 1740 gctccagctc ccggaccccc ttgatccatg ccctccccgg cttgtcagaa caggagggac 1800 agaagacctc ggccgccact cgcccagagc ctcactactt ctttctgctc ttcctgttca 1860 ccttggtcct tgctgcagcc aggcccaggt ggcggtaact gccccctagc ccaaaggact 1920 gtcttggcca aaacatgcaa cagaccatat ttacttccct tggccttcga ggcccagggc 1980 aggataagga gacagtagct ctgagtgctg gggcagggcg catggggtcc tggccttcct 2040 gggtctggcc acgcctgtca ccctagcttc tagttgttgt atcaggtcag ctgcgagcca 2100 gtgtccccaa aagccatgtt tttcagggac ctcaggggca cctctggctg cacactcctc 2160 ccctaccctc ctgtaccacc ccagaagctc acgaggtcac ccagaagggc ggttattagc 2220 tacaacccat cggagacctc aagtgagtct gtgtcttcct ctcctgcctc ttcccggtgg 2280 ggactcccca ccagacccca tgggacacag atgtcagaaa ttgaggccca tcccgcagct 2340 ccccaggaag cctggaaggg atgccagtat gtcgctgacc aggctctggc agggcctaca 2400 agtttatgca tgataccttc tcctcagaag aagctctgca ggcaggatcc caacacacac 2460 cagaccagga ccctgtgcca ctgtggagct cagtgaatct ggttctcaaa gaagacccac 2520 tgtggggttc ctctagtgtg acataggtga ggtggcagct ggcaaggccg cactaggctg 2580 ccacactgtt tgccccctca gatgggctca taaagactgg gcttgagggt ccacatgaag 2640 agcctcactt caggggaata gccggccacg tctagcccct atcccactta gaaaggtcac 2700 ctggccacgg tgatgctggg tggctgatga aacttaagct cagggtcagt gggacctggc 2760 actgcaggtc aagaggacgc ctgggcccct tcctgaccca ggcagctctg caggagctgg 2820 gggagcattg acagtcaagg gcttttatag acatacacat tcagaccctc ggtgtccttg 2880 tccactgagt gctgtatctc atcgtatctc aatcttcatt gacagcactg gagaggctcg 2940 gggcaccact tggagccttg tatcccgcag gcttgagacc tgggggcacc tggcttacca 3000 aggttggcaa ggctccatct tcccttccag gggcttgggg atgctaagtt gctgtatcca 3060 ggaagacagt agtcctcaca tgcagaaggc ctgggaaaag gctgcttgtc gctttttttt 3120 tttcctttct ttttgtcact gggttagaga ggctcccgcc agcacagcac caaggcctgc 3180 ccagtagtca ggtctcctgg tccaggatgg gtgtgctctg tagttggtct gtgggtttct 3240 taggcccatg ccctgagcac attcacccag ccacatccct gctagtgaca ctcaggcagg 3300 ggcactggga accagacctg ggctctggca tcaacgagtg cctaggtgtc aggacagcgg 3360 cacccatcct gtacagggca gccaggatag tggccaagct actgtgtcct ttctccatga 3420 ggctccctgt cactcggtgc cagtgggtaa tgtgtgttct ttgagtcctt atatgaataa 3480 aaggctggag acctaccaaa aaaaaaaaa 3509 103 3044 DNA Rattus norvegicus 103 tgcactccag ccatccagcc atggctccct cacagctcgc gtggctgctg cgcctggccg 60 cgttctttca tctgtgtact ctgctggcgg gtcagcacct cggcatgacg aaatgcaaca 120 tcacgtgcca caagatgacc tcgccaatcc cagtgacctt gctcatccac tatcaactga 180 accaggagtc ctgcggcaag cgcgccatca tcctggagac gagacagcac agacacttct 240 gtgctgaccc aaaggagaaa tgggtccaag acgccatgaa gcacctggac caccagaccg 300 ctgccctgac tagaaatggt ggcaagtttg agaagcgtgt ggacaatgtg acacctagga 360 tcacctcggc caccagggga ctgtccccaa ctgccctggc aaagcctgag tcagccacag 420 tggaagacct tactttggaa ccgactgcta tttcccagga ggcccgaagg cccatgggga 480 cttcccaaga gccaccagca gcagtgactg gatcgtctcc ctcaacttcc aaagctcagg 540 atgcagggct tgccgccaag cctcagagca ctggaatttc tgaggtggct gctgtctcta 600 ccaccatttg gccgagttct gctgtctacc aatctggatc tagcctctgg gccgaggaaa 660 aagctactga atcccccccc actatagccc tatctactca agcgtccacc acttcaagcc 720 ccaagcagaa tgttgggtct gaaggccagc ccccatgggt ccaggagcag gactccactc 780 cagagaaatc tccagggcct gaggagacaa acccagttca tactgatatt ttccaagaca 840 gaggacctgg cagcacagtc catccctcag tggctcccac ctcctctgaa aagaccccca 900 gcccggagct ggtggcctcg ggcagccagg ctcctaaggt agaggaaccc attcatgcca 960 ctgcagatcc ccagaaactg agtgtgttta tcactcctgt ccctgactcc caggcagcca 1020 cccggaggca ggcagtgggg ttactggcct ttcttggtct actcttctgt ctgggggtgg 1080 ccatgtttgc ctaccagagc ctccagggct gtccccgcaa aatggcaggg gagatggtgg 1140 aaggcctccg ctacgtcccc cgtagctgtg gcagtaactc gtacgtcctg gtgccggtgt 1200 gagctgcctg cttgcctgcc tgtgtccaga gtgtgattca gacagctgtc tggggacccc 1260 catcctcata cccaccttca tccacgctgg gagaatggga atgcagaagc tagatcctcc 1320 aggggccatg tgctccaacc cccgaggagt ggccctggag gccaccctag accattattc 1380 acttatcaga gacagacaga gcaggtggcc ttccagctcc cccatatttg aaagaatcct 1440 ctgctgctgg ctggttagag gggcccttga catcccaaac tctatgagca attatttatt 1500 ggattcccag ccccctgaga cacccgtttc cccgtgcgca ccgtggtcca cccatcttac 1560 aagcagcagc caggcctctc tgcctgtccc ctgacctcct cgtgtctcct ggctttgctg 1620 cagtcgccag ccctgtctcc tccccggcca gctgcggtgc tatctatccc ggtctctctc 1680 tctcgtgtac agagccatca ccaccatcac caccaacaac tcgttctgtc tttgcttgca 1740 tgaggttaat gctgtgtttt ctggagctct ctgggacggg agatgagctt ctgtgagggt 1800 ttaaagtgtt cctccccaga ctttgatgtg ccgtgaagca tgctgcctct gaaggaaggc 1860 tctggtcccc actccgctgc cagcacaaca aagtgcccca cctgtaaagg aaaagagact 1920 cggcccagag ctggcaaaaa cccatggccc tgacatcatc actttctctg agatctttgt 1980 ccccatccct ggatgccacc cccccccccc agcccttatc aacatgaata gtcactgcca 2040 ttccactgga ctgacacttt tgtatgctgt gattctgagg gctggcaagg gatgacttga 2100 gagtgcagat cctacccatg ggcccccaaa tggaggctga gctggggacc tgcaggaaga 2160 gaggccaact cagaaggctc cgctgtgttc tcactggcac ccctcccctg tgcaccaagg 2220 tgacagtcac aggtctgccc tgcctgaaga acaagccaca gaaggaagat tatgacaagt 2280 ccctgggagg ccaaggatcc agggcaaacc ctggagtggc cacacaccca atttcagctt 2340 agggacttgt gcatgtgtgt acttgcacag cccagatcat tcagaagctg ccaggatcct 2400 ttcctacatc tgagagcgca gttcctgcca aggtctcacc ttcgcctcac ttcaggcagg 2460 gcagaactcc cataacattc tccaagagcc ctgtgacgtg ttctggaagg gactctgccc 2520 tgggcacaaa gtgtctactg aagcagagag cagcgcccca gccccagccc cagcgcccca 2580 gcgggagctg taccggcaga ccacgccctg ggggggaggg ccctccgtgg cagctttccg 2640 ctctgaatag ctccaaccgt cacctttgga gcctcccagg ggcgggcttc acccagccag 2700 tgactcactc cttgataggt ggaagctcag aacaggtggt ctcgtcccag agtgaggaag 2760 ccagcccctt ggcgaccctc ctcttgggaa gcttgtggga ggctctggtc tggctccaga 2820 gtactagttg taggcctgag gagcagcagc ccgagtgcac tatatcctgg ttcttcggtg 2880 gggagccttc aagggttggg acacccatgg ttggactttg ttggttcccg cgcctcgtgg 2940 gccaaaacaa tctgaggagg actttggaag gagttggttt ttcaagcatc attaccaatg 3000 tctgtgccat tttgtatttt actaataaaa ttttaaagtc ttgt 3044 104 684 DNA Rattus sp. 104 cacagttcag cttttattaa aacgccgtgt acagtgtggg gagctggaat gggctcctgt 60 gaaaatgata cgtttgaggg gattttcccg ctcttaccgt actgatcacg tggcaagtgt 120 tcttttacaa ggtatatggt ccctaacatc catgcagaaa atatcaggtc aagtgcttta 180 ctgtcccagt gtccagtctg gtgacccata tcagaggcag ggcattcgca gagatattct 240 cattccgtgt cttgagtttt ctttcgaaga aagtatttta cgagtcatat acaaattaaa 300 caagaagtcc aacgagggcc ctttattacc accaaggggg ggaaggggta acttacaatc 360 ttataaccaa aggcaaaaac agcacaagaa cagtttattc ctgcagatgc cgcagccctg 420 tgcaatcccg cctcggctgc tgctacagcc agaagtcaga gccccgcagt gaaggctgca 480 cgggaactct aggctggatt acccacacag aacaagtcgc tcctgccccc gtacgttgga 540 agtgctgctg gattttctgc ttgctcagtg gataacagtg cagctgaaca tgaggggctt 600 aaatagcacg ctcgcgcgca cacacgcatg catacacaca cgtgcacaca cacacacaca 660 cacacacaca cacacacaca caca 684 105 3471 DNA Rattus norvegicus 105 ggcacgagga ccggctgagg attttatggt tcttaagcgg acttaagagc gttgtttcgg 60 attgttaaga ttcccgtttg ctgggttttc ctccctcaat cgtgctctcc cgcggctgcc 120 tggggactgg ctcggcgaag gaggatggag agggggctgc cgttgctgtg cgccacgctc 180 gcccttgccc tcgccctggc gggcgctttc cgcagcgata aatgtggcgg gactataaaa 240 attgaaaacc cggggtacct tacatctccc ggctaccctc attcttacca tccaagtgag 300 aaatgtgaat ggctaatcca agctccggag ccctaccaga gaatcatgat caacttcaac 360 ccacatttcg atttggagga cagagactgc aagtatgact atgtggaagt gatcgatgga 420 gagaatgaag gtggccgcct gtgggggaag ttctgtggga agatcgcacc ttcacctgtg 480 gtgtcttcag ggccatttct cttcatcaaa tttgtctctg actatgagac ccacggggca 540 ggattttcca tccgctatga aatcttcaag agagggcccg aatgttctca gaactataca 600 gcacctactg gagtgataaa gtcccctggg ttccctgaaa aataccccaa cagcttggag 660 tgcacctaca tcatctttgc accaaagatg tctgagataa tcctagagtt tgaaagtttt 720 gacctggagc aagactcaaa tcctcccgga ggagtgttct gtcgctatga ccggctggag 780 atctgggatg gattccctga agttggccct cacattgggc gttactgtgg gcagaaaact 840 cctggccgga tccgctcctc ttcaggcatt ctatccatgg tcttctacac tgacagcgca 900 atagcaaagg aaggtttctc agccaactac agcgtgctgc agagcagcat ctctgaagat 960 ttcaagtgta tggaggctct gggcatggaa tctggagaga tccattctga ccagatcact 1020 gcatcttccc agtatggtac caactggtct gttgagcgct cccgcctgaa ctaccctgaa 1080 aacgggtgga caccaggaga ggactcctac agggagtgga tccaggtgga cttgggcctc 1140 ctgcgattcg ttactgctgt ggggacacag ggtgccattt ccaaggaaac caagaagaaa 1200 tattatgtca agacttacag agtagacatc agctccaacg gagaggactg gatcaccctg 1260 aaggagggaa ataaagccat tatctttcag ggaaacacca atcccacgga tgttgtcttt 1320 ggagttttcc ccaaaccact gataactcga tttgtccgaa tcaaacctgc atcctgggaa 1380 actggaatat ctatgagatt tgaagtttat ggctgcaaga taacagatta cccttgctct 1440 ggaatgttgg gcatggtgtc tggacttatt tcagactccc agattacagc atccaaccaa 1500 ggagacagga actggatgcc agaaaacatc cgcctggtga ccagtcgaac cggctgggcc 1560 ctgccaccct caccccaccc atacatcaat gaatggctcc aagtggacct gggagatgag 1620 aagatagtaa gaggtgtcat cattcaaggt gggaagcacc gagaaaacaa agtgttcatg 1680 aggaagttca agatcgccta cagtaacaat ggttctgact ggaaaatgat catggatgac 1740 agcaagcgca aggctaagtc ttttgaaggc aacaacaact atgacacacc tgagctccgg 1800 gcctttacac ctctctccac aagattcatc aggatctacc ccgagagagc cacacatagt 1860 gggctcggac tgaggatgga gctactgggc tgtgaagtag aagtgcctac agctggaccc 1920 acgacaccca atgggaaccc cgtggacgag tgtgacgatg accaggccaa ctgccacagt 1980 ggcacaggtg atgacttcca gctcacagga ggcaccactg tcctggccac agagaagcca 2040 accattatag acagcaccat ccaatcagag ttcccgacat acggttttaa ctgcgagttt 2100 ggctggggct ctcacaagac attctgccac tgggaacatg acagccacgc gcagctcagg 2160 tggagggtgc tgaccagcaa gacggggccc attcaggacc acacaggaga tggcaacttc 2220 atctattccc aagctgatga aaatcagaaa ggcaaagtag cccgcctggt gagccctgtg 2280 gtctattccc agagttctgc ccactgcatg accttctggt atcacatgtc cggctctcat 2340 gtgggtacac tgagggtcaa actgcactac cagaagccag aggaatatga tcaactggtc 2400 tggatggtgg tcgggcacca aggagaccac tggaaggaag ggcgtgtctt gctgcacaaa 2460 tctctgaaac tgtatcaggt tatttttgaa ggtgaaatcg gaaaaggaaa cctcggtggg 2520 attgctgtgg atgatatcag tattaacaac cacattcctc aggaggactg tgcaaaacca 2580 acagacctag ataaaaagaa cacagaaatt aaaatagatg aaacagggag caccccagga 2640 tatgaagaag ggaaaggcga caagaacatc tccaggaagc caggcaatgt gcttaagacc 2700 ctggacccca tcctgatcac catcatagcc atgagtgccc tgggggtgct cctgggtgca 2760 gtctgtggag ttgtgctgta ctgtgcctgt tggcacaatg ggatgtcgga aaggaaccta 2820 tctgccctgg agaactataa ctttgaactt gtggatggtg taaagttgaa aaaagataaa 2880 ctgaacccac agagtaatta ctcagaggcg tgaaggcacg gagctggagg gaacaaggga 2940 ggagcgcggc aggagaacag tggaggcgca gggactctgt tactctgctt tcactgtaag 3000 ctgggaaggg cggggactct gttactccgc tttcactgta agctcggaag ggcatccgcg 3060 atgccatgcc aggcttttct caggagcttc aatgagcatc acctacagac acaagcaggt 3120 gactgcggta acaacaggaa tcatgtacag cctgctttct tctcttggtt tcgtttgggt 3180 aatcagaagc cagttgagac caagtgtgac tgacttcatg gttcatccta cttggccccc 3240 tttttcctct ctttctcctt accctgtggt ggattcttct cggaaactgc aaaatccaag 3300 atgctggcac taggcgttgt tcagtgggct ctttcgatgg acatgtgacc tatagcccag 3360 tgcctagagc atattagcat aaccacattt caggggacac caatgtccgc ttttgcatcg 3420 ctacgtgcag cgagcacagg aaaaagaaaa aaaaaaaaaa aaaaactcga g 3471 106 528 DNA Rattus sp. misc_feature (1)..(528) where n may be a or g or c or t/u, unknown, or other 106 gggaaggaga gggtttattt cagcttacag ttcaacatca cagttcatca ctgagagaaa 60 acagggcaga agctaatgct ggggtcacag agagtgcatc tgtgtagggg atttggcagg 120 tagcctgcag tctggtcact cttgaggaca gcattttgct agctcagcca cccagccaca 180 actcagccgg gcccaggaag cgacccagcg tcattgctat gggtcgttta cacaaaccac 240 accagcatca attaacactt ttttctgtac ctgatgccac agtttaaaga aaaaacaaaa 300 caaaacaaaa caccaacaca cgactataca tcatcacagt tgggcagccc atgggtgagg 360 tttccacagc tactccacaa aggtacccat cctgcgtcag gagatccagc accttgccac 420 agactgcagc tcacaggtgc ctgtacactg aatattttgg tctaaaattt gacctttatg 480 ggatggggag agcnctcttt gttcttaaac ctggatatcc ttcaagtc 528 107 437 DNA Rattus sp. 107 gaagtaaact aaatttatta tcttcttaca agaatgcttt agttgtttgg ttttgtttca 60 agtcctgtgc cttttaactt tcaccataaa gtattatttt ttacatgtcc ttgacaaagg 120 cttgataagc agttcaactc tttgggcaaa gtcactagct cttatccgaa ttagaataca 180 ggacacagaa taagtcaaga caatccatga catttttatt gagcctttag tttatgaggt 240 aaaggaaaag cagctctctt gaatacttta catagctgta ggaaaatatt tgagacttaa 300 agggagttac aggggattat gtttctcaaa agttgagatc aagtaaacaa gctttaaatt 360 gtttaaattt tccagttgat ctttccttac aatagtaaca agctcgaatt agccacatta 420 ggtttttatg tttgcag 437 108 385 DNA Rattus sp. 108 tgtgtttggt aaagggccct gggtgaggat atagctcctc tgggcttcct ccatttttat 60 caagctggca ttgatatagt cattatcttc ctgatgcaat ttaatccgac tgtggtcaaa 120 agggctgaca tctcggtacc tgttccggtt tttgttctta ggaagtttcg ctattctgca 180 tgggaagtca ctggcttcat gtcgaatatc ctggtaaata gccgcccagt tcccagcctt 240 atcgatctgc tcgaattcct tctccatttc catggcggga cgggacgact gcccgagttc 300 cctcggccgt ccgccgcgct aggccgcgtc gctccgcgtc tcgaggcccg tcgcccgcca 360 cctcgtgccg cctcgtcctc gtgcc 385 109 646 DNA Rattus sp. 109 attttttttc tatttttttt ttttattatg gtttacagct actttattta caactataca 60 tttaacacaa tgagataaac actgatagac tgaaggacta aggtttgcag ctactttatt 120 tacaagtata catttaacac aaggagataa acactgatag actgaagcct aactaatagt 180 actgtaacgt gtaccatttt gatgactaca ttattttaaa caacaaacta cactgaaaaa 240 ttaatgccga taaaattctt ggtcataata ttaagaaata caatatataa attgaaaata 300 tgattgttta aaatttgaaa atggaagtga actcatttgg acagagtcag acgttaacat 360 aatctgagag gggaggacct ctgacccaaa tgacatcgtt caggttaaca gaacaaaaca 420 gaagcctagt tttatcttca aggatgacag gcagcttgct tcctcaggtg gaatacactc 480 aggctctgca tcacgcgcgt cacatcaccg agcacgaaga cttcccttgt gaagccgctc 540 cgtcaatctt ttctgcttcc aaaattatcc ttcgaaaaac atccacagca gtctgatttt 600 cttttgcaga agattccaaa aaagctgcat tccaagattc tgccaa 646 110 484 DNA Rattus sp. 110 atagaaagag atatttaata cttttaaaaa aaaaaaaaat taggagtaaa gtccacttca 60 caggctggag gacagtgttg cccatctagg gggcatgggt gtggccccac gtaccaggga 120 caagccacgg gttgtggggc agctgtagag aagagtgtaa ctgagtccag ccaaagcctc 180 cctcatgatg cagagccgcg gctgtggcca gcctcgcatg ctggttgaaa agggacctgc 240 caactgcctg tggccagctg tagagtcaca tgagcaggta gcagtggagc tggctcaggg 300 gaaggtgcca gggacctggg aatgtccact cttcagggcc tttgtgcttg gtcttcaaca 360 gggctgagct gggtctgtgg tcccatggga agtgtacacg ttcatgcctt gtgcacttct 420 gcatgagcag caggtgccag gtgtctccat atgctgagct ggctgtgctt gcagcctcgt 480 gccg 484 111 711 DNA Rattus norvegicus 111 gggactcgca cttgcaatat gactttggag gaattctcgg ccgcagagca gaagatcgaa 60 aggatggaca cggtgggcga tgccctggag gaagtgctca gcaaggctcg gagtcagcgc 120 accataactg tcggcgtgta cgaggcagcc aagctgctca acgtagaccc ggacaacgtg 180 gtcctgtgcc tgctggctgc ggatgaagat gacgaccggg acgtggctct gcagatccat 240 ttcaccctca ttcgtgcttt ctgttgcgag aacgacatca acatcctgcg ggtcagcaac 300 ccgggtcggc tggcagagct gttgctactg gagaacgaca agagccccgc tgagagcggg 360 ggcctggcgc agaccccgga cttacactgt gtgctggtga cgaacccaca ttcatcacaa 420 tggaaggatc ctgccttaag tcaacttatt tgtttttgcc gggaaagtcg ctacatggat 480 cagtgggtgc cagtgattaa tctccccgaa cggtgattcc ccgaacggtg atggcatctg 540 aatggaaata actgaaccaa attgcactga agttttgaaa tacctttgta gttactcaag 600 cagtcactcc ccacgctgat gcaaggatta cagaaactga tgtcaagggg ctgagttcaa 660 ctacaggagg gctaggagat gactttgcag atggacagag aggtgaaaat a 711 112 608 DNA Rattus sp. misc_feature (1)..(608) where n may be a or g or c or t/u, unknown, or other 112 cggccaactt gatgttctag tgctgaaggg agcaagggcc aggcatgtgg tggagatgat 60 gctgaaatgg tttatccaat accatgcaaa tcaagtcctt tggatagagg tgaagaactt 120 ggacatggct gtttcaggca gctgaagtca aaaggaccgg gactggggag ggcagggacg 180 ctaagtgaaa gggctggtgg ccaacgggcc ttctactgac tagaaggact tggctctgag 240 atcttccatc tgacctaaga ggcagcccct cttgcgggcc atctcttccc tcctgtcctt 300 tattgtctct cgcactcccc tccctttgct ctctttcatt ccctttagca aatttcaatt 360 gtcctaggag aaaaggttgc tgtcatgtct gaaagacccg ttgaggtgct gaaggagttg 420 ctgacccctg cagactggaa ggtgctgacc tgggcaggga aagcaggtgg gacggaggca 480 taggtggtgg ccaccgaggg cgatgggaag ccactgtgcg caggagacgg gtaggtagag 540 gagcccggag aggagtaaga ggtgggcact ggggatggaa atgaggtggt ggcggnggat 600 gggttaga 608 113 539 DNA Rattus norvegicus 113 cctcggcctc cgggctgcgg gaacgcccgg actgggaacg ccacgtcgag aggcgttcgc 60 ggaaggcgcg ggatccagga cgtgctggtc acccccaaac cccaggccac ccattatcgc 120 cttggttcgc ccatcagagt tgtaagaaaa tggcagacaa gccggacatg ggggaaatcg 180 ccagcttcga taaggccaag ctgaagaaaa ccgagacgca ggagaagaac accctgccga 240 ccaaagagac cattgaacag gaaaagagga gtgaaatctc ctaaaagcct aggaagattt 300 ccccacccca ccccttcatc tccaagaacc ccctcgtgat gtggaggaag agccacctgc 360 aagatggacg cgagccacaa gctgcactgt gaacccggca ctccgcgccg atgccaccgg 420 cccgtgggtc tctgaagggg acccccccac taatcggact gccaaatttc accggtttgc 480 ccagggatat tatagaaaat tatttgtatg attgatgaaa ataaaacaca cctcgtggc 539 114 292 DNA Rattus sp. 114 gcccgttgac cagatccact agaactgtct gcattatcta tgcagcatgg ggtttttatt 60 atttttacct aaagatgtct ctttttggta atgacaaacg tgttttttaa gaaaaaaaaa 120 aaaaggcctg gtttttctca atacaccttt aacggttttt aaattgtttc atatctggtc 180 aagttgagat ttttaagaac ttcattttta atttgtaata aagtttacaa cttgattttt 240 tcaaaaaagt caacaaactg caagcacctg ttaataaagg tcttaaataa ta 292 115 2299 DNA Rattus norvegicus 115 ccactgcagc ttttctctgt tgcttaaggc tttcggttac ttctctttct ctaagcgagc 60 gttctttgtt ggtgacccga gttgctagtc cagaaggaac agactgagtg agcgaggcgc 120 catgacaacc ctggatgaca agttgctggg ggagaaattg cagtactact acagcaccag 180 tgaggatgag gacagtgacc atgaagacaa agacagaggc aggggagccc cagccagtag 240 ttccacgcct gcggaggctg agctggcagg cgaaggcatt tcagtcaata caggtccaaa 300 aggggtgatc aatgactggc gccgcttcaa gcagttggag acagaacaga gggaggagca 360 gtgccgggag atggagcggc tgatcaaaaa gctgtctatg agctgcaggt cccatctgga 420 tgaagaggag gagcagcaga aacagaagga cctccaggag aaaatcagtg ggaagatgac 480 tctgaaggag tgtggtatga tggacaagaa tttggatgat gaagagtttc tgcagcagta 540 tcggaagcag aggatggacg agatgcggca gcagcttcat aaagggcccc aattcaagca 600 agtgcttgag atccccagtg gagaaggatt tttagatatg attgataaag aacagaaaag 660 cacccttatc atggttcata tttatgaaga tggtgtccca gggactgaag ccatgaatgg 720 ctgcatgatc tgccttgccg cagagtaccc cactgtcaaa ttctgccgag tgaggagctc 780 ggttattggg gccagcagtc gttttacccg gaatgccctt cctgctctgc tcatctacaa 840 ggcgggtgaa ttgattggca attttgttcg tgtcactgac cagctgggcg aagatttctt 900 tgctgtagac cttgaagctt tcctgcagga atttggattg ctcccagaaa aggaagtctt 960 ggtgctgaca tctgtgcgaa actctgccac ctgtcacagt gaagacagcg atctagaaat 1020 agattgaact gataatccag ttctgtagct gtctcattgt ttgggctaga ggacacatgt 1080 ctgtatttat ttctgtcctt cctgtcttct ggctttacag ctgctctttg tagtctggtt 1140 tagtatgtgg aaagtcaaga aactcagatt aaatcagaat cctgactcac tttgtggcta 1200 gcagtaaagc gatttctaat tatatagaca ggaagctggg ttcttgagct gtttacatct 1260 ctagcgtgac atctctgaaa ttgtttccag tcaatattga catggcaccc ttgaaggcaa 1320 tgtcttgaaa attgtcttct gatgacctca gaattccacc aggtctgaga gtagaattcc 1380 ctagtgagtg tgtttctgtg cagtgtaaac agtgcatttc cataatcact tgattgcaaa 1440 tcatgtttac ttgcaatcag actgtactta ttttctccag atccttttct accagggtcc 1500 atacaaggtt gggaatatag ctcagtggtg aactcttaac ctagcataca cgtagctcca 1560 aggtatctga ttcccaccca gcactgtcaa aactaaaggt tccagatggt gcctgatgct 1620 gtagttagca gtggtctctg ctgagggagt cactggtcca gtccagcata atgaagtatg 1680 aacaggaagg gatgttttct tgctgactgc tgtcatgatg ttgggaagca ggcatcctat 1740 tgactgacac tggtgtgtat agagcttgaa aacctcagta ggaggacctt tctgtatcct 1800 gctcatcagc atcttttcct ctgttattta gcaaggtaat gctttgttta acctttttaa 1860 ctttttaaaa agcttttttt gcctattaga aaaaaattca tatttactag aggaaaattg 1920 gccaaaatac agatgcaaaa atggtttaaa aatataattc tgccacctga atactgtatg 1980 ttttggtacc attcatatat agggtttttt ttttaatgta ttctagggtt ttttttttat 2040 ggattcatag gctttggttt gtctttaaat gacattggga tcattctgag catactgtca 2100 gatagtgtga tcagttacta acagaagttt ttcatgctat taaatactct tctgtaatat 2160 tttaataact gttgatattc cattgatttg ctgaaatctg gtgtttgggt tttagaaaga 2220 tagcaaactt ttttattata aactttctat tatgaaacat gtttatcata gaacatacaa 2280 taaaaattaa gttaaaaag 2299 116 1739 DNA Rattus norvegicus 116 ggaggacggc acgtcgcggg tggcattgtg tgtcccagtg tgcaagaata gccccagaag 60 aggaaaggct gagcccagag cgcttcagca gggaagattc ccttcccccc gcttcaggct 120 gctgagcact gagcggcgct cagaatggaa gccatcgcca aatatgactt caaagctact 180 gctgacgatg agctgagctt caagaggggg gacatcctta aggttttgaa tgaagagtgt 240 gatcagaact ggtataaggc agaactcaat gggaaagatg gcttcatccc caagaattac 300 atagaaatga aaccacatcc gtaagtcaga ctctatggtt cccctaagca aggggtttgc 360 ctgcttgtct ccagcactgt tttgatttgg ttttgggggg atgatgtaga aggctcttca 420 ctatataggc cttgttcccc taagctgtag ccagcctgta gaccaggctg gcctcacaga 480 gacccacctg cctctgcctc cctactgttc cagcatgccc agattggttt ggttttgtga 540 gataggctta aaagtcttaa taaagacatc aaaatcaaaa taataatatt gaagaaatca 600 aaatcaattt aaatggcaaa tatatgttag attactggac ctctgtcctt ggttcagaat 660 gccccttgcc tgcgatctct gcagcctaag attgggagcc agcgtacagc cagtccaggt 720 tactcagagg cagactcgaa tggggataag tccgaagaag ctttgagact ccaagcatga 780 agtcctcaca gcttcctgtg gtgggataag cccacttggc actcctacag ccatttagcc 840 ttgggttttt tgtcttttag gtattttttg ctttagggag ttttgtgggc ttttcacaga 900 caaaacagta atacgctggt ggagaagcac gaagcagtga ggatgacccc tagatcagac 960 ctagactcag cctgtgcatc tcaccaagag tccctacgcg tctgtctcct tcatccatgc 1020 tcaaaaagct tctgtgccgc atggtatcac agaccgtgtg ggtcccagct cctaggaggc 1080 agaggcaggg gatccctggg agtcgggacc agcctggtct gtagtattga gattagccag 1140 gactgcatgg tgagaccctg tcttaaagaa aagaaagaaa ggatggaggg aagcaaggag 1200 ggagaatata aaagtgtacg tgggaaaaca caggctggag aggtggctta gtggttagga 1260 gcattgtctg ctcttccata gaatttgggt ttgagtccta gcaccaacat catggctcac 1320 agccatctgt aatttaagtt ctagagaccc aagtcctctt ctggccacca aagtcacaga 1380 catacatgca ggcataacac tcatgtatat actttttatt ttttaaataa gaaagaaaag 1440 cagaaggtca ggagacagtt tgtaaaggag gaagttttct ttctttctct ccctttgttt 1500 atttcttact tggtcaaggc cttactatat ggctctgact agaacttttg ttgaccactc 1560 tggccttaaa ctcaaagatc tgcctgcctc tgcctctgcc tctgccttcc aagtgtgtgt 1620 attattaaag gtccatgcca acacacatgt cctgccttct tttaaaaagt aaattgtggg 1680 ggttggggat ttagctcagt ggtagagcgc ttgggccctg ggttcagtcc ccagctccc 1739 117 2376 DNA Rattus rattus 117 cgcggccgtc cgctctccca actcgcagcc agtcggcgcg tcccgcctac tgagcgcagc 60 ctccaccagg atccgcgggg accagctcgg gatcagccgg cgacccactt ctgaccaacc 120 caggagcggc ccgataccca ctcccgacca acccgcgacc gacccaggga cccactccgg 180 acctgctcct tacaggggac agcgcctcgc cgcttcccgc cgcccagcgc ccgcacgctc 240 ctcgggacac agtgccaacc atccagagga caagatggat tggggcacac tacagagcat 300 cctcgggggt gtcaacaagc actccaccag cattgggaaa atctggctca ctgtcctctt 360 catcttccgc atcatgatcc tcgtggtggc cgcgaaggag gtgtggggag atgagcaagc 420 cgattttgtt tgcaacactc tccagcctgg ctgtaagaat gtgtgctacg accactactt 480 ccccatctct cacatccggc tctgggctct gcagctgatc atggtgtcca cgccggccct 540 cctggtagct atgcacgtgg cctaccggag acacgaaaag aaacggaagt tcatgaaggg 600 agagataaag aacgagttta aggacatcga agagatcaaa acccagaagg tccgtatcga 660 agggtccctg tggtggacct acaccaccag catcttcttc cgggtcatct tcgaagctgt 720 cttcatgtat gtcttttaca tcatgtacaa tggcttcttc atgcagcgtc tggtgaagtg 780 taacgcctgg ccttgtccca atacagtgga ctgcttcatt tccaggccca cagaaaagac 840 tgtcttcacg gtgttcatga tctctgtgtc tggaatttgc atcctgctaa acatcacaga 900 gctgtgctat ctgttcatta ggtattgctc agggaagtcc aaaagaccag tctaatgcat 960 tgcctggctg ttaagcaaag atgagggaga ggatgaggca acctgtgctt agttatcaga 1020 gttcagctac cagcatctcc cgggcaaaca ttcccacctt aaatgccgcc atttgaagtc 1080 ccccgcaggc ctcccatgaa actccagaag cctccatggg cctcccttcc cccaaagctc 1140 ccaaacaaag gcccaattct atgcctgtat taatgggttc taaagttagt tagaccccgt 1200 gctggtgtga ctatgcttta ggatacattc acagtttaaa caaagggatc tcacattgtt 1260 tctcttcctc tgaggacagg agacatgagc ccagtcctga ggaaggtaca gagaaagttc 1320 cttcttccgg gtccccttcc ccaagttgcc cccagttaag ggtaaagaat cttcgttctg 1380 ttattttctt tcatagttta agtttgcaac aatggacaaa agctatttaa tgttcaagct 1440 agctgtgtcc tttttttttt ttttaaatga aaaccttaaa atgataggtt cttttgttct 1500 taaaatgatc tggaaagcat tatacattcc tcctatttca gaggttcggt ttgtgatgtg 1560 agcatggtgt ataaccagat ctcacaaggt ctttaaaacg ttggcctttt ggttatggga 1620 aacctgggct gtggctgaga gcccacctac tgtattcatc cttaggtgtg ctgagtacag 1680 cccgcaacaa cgttacagcc tgtctcaaat gagacaaact ggaagcttct cgtgttagct 1740 tctgacaaga agaggccttg attaaaattt tcaaccgtaa ttttgtgtaa gaggcagata 1800 ggttatgcct acaactgccc cctgccatga gcctaactca gcccccctcc acccccagct 1860 cgtctactct gtagctgtgg gatgtggcag tcagtatcaa aagacttcat gagtttgctt 1920 gggaatttca ctgccatggt acaatttaat ggtgcagaaa caagatgggg tggttttcaa 1980 agaaccgatg aaacttctag actctaaatc ctgttgatta aaactgagtt tttctacttt 2040 gaatgtctgt ttgcctccct tttcagcatt gccttctaaa ctggaaacag aaatgttgat 2100 atttggaaaa aatagaagaa actagtttag gtcaatgtgt aacttttcta ggacaagttg 2160 aaccttagca ttgtcattct gcctgatgtg ttgtccacaa gatgacagtc aacaaatcca 2220 acaggggaca cttcttcctg ccaagaatgt cgttgggaag ccattctgta acaataaata 2280 agagttgtgg tttaaagtct acactatttt acctaatgaa gaacttattg ctgatgttca 2340 gaaattcgac attgaaaggt gttttgccaa tacggg 2376 118 623 DNA Rattus sp. 118 aacggtgtca taaataagta atataacttt attaaaatga aaagacaata ttcaaaataa 60 tgcaacaaaa tgaacaaatc ctttgtccaa tactgtacac acagtgcgga gatcagtgca 120 ttttctaaag catgttttaa ccttcattta gttcatacta aaaataagct ttaaatagct 180 caaataatgt cattcagcag tttaaactga acagcttgtt gggacatggc agcggtgtcc 240 ctctgttagc aagcaccttc tctttgtgct tatctataca agataaacaa tcagaggatg 300 taaaaattga acacaagcta cacgtctcac tgactctcag ggcagtgagg cagccagctg 360 tgagttttct aagcaggaag atgctgaagt gacctctggc attaagacgt tctgtgctat 420 tggtcagaag tgtttcactt aaaaagcaaa caatccccag gaaatactga ataggaacca 480 acaacacaag accagcttgt gttgtaattt aaagtctcaa aaacaaaaca acatcccata 540 caccccacga taactctcaa tcatggctaa tcactgggct ggtctataaa acttataccc 600 aacactgacg gccagcacca ctc 623 119 365 DNA Rattus norvegicus 119 ttgccttaaa tgttttatta caattaaaat ttcacaaaca cagatcaatt aatctcaatc 60 aaaataactc atgtttacat catatttata gacaagctgt acaataaata cataaatgca 120 ttcacagtct gtgcttcagt catcctcctc ctcctcttca tcgctgatca catacttctt 180 atgcttttta ttggccttga catcgtcctc tgctttcctc ttcccagaag actcaccttc 240 ctcgtcactg ttgagcttct ttgctttgag cagcctctga gctttgtctt cttccgagcc 300 ctcgtcgctg tctgaggagt agattctggc ccgttcctcg cggatgcctc ctttgtagcg 360 gtttt 365 120 1095 DNA Rattus norvegicus 120 aaattgcaag cgtattcttt taatgactcc agtaaaatta agcatcaagt aaacaaaagt 60 ggaaaagaac ctacactttt aacttgtctc actagtgcct aaatgtagtt taaaggctgc 120 ttaaattttg tgtgtagttg gattttttgg aagctgaagg tatccatctg cagacattga 180 ggcccaagtt gaatttggat tcgagtggat tcttaacact tctgcctgtg ctgaagagaa 240 gcttcataag gaacaagcaa gttgaataga gaagatagtg atcaataaga ggcatttagt 300 ggtcttttta atgttttctg ctgcgaaaca tttcaagatt tattgatttt tttttttcat 360 tttccccacc acactcacac acgcacgctc acacttttta tttgccataa tgaaccgtcc 420 agcccctgtg gagatctcgt acgagaacat gcgttttctg ataactcaca acccagccaa 480 tgcgactctc aacaagttca ccgaggaact caagaagtac ggagtgacaa ctttggtccg 540 agtttgtgat gctacctatg acaaagctcc agttgaaaaa gaagggatcc acgttctaga 600 ctggccattt gatgacggag ctccgccccc taatcagata gtagatgatt ggctaaacct 660 gttaaagacc aaattccggg aagaaccagg ctgctgtgtc gcagtgcatt gtgttgcagg 720 attgggaagg gctcctgtgc tagttgcact tgcactgatt gaatgcggaa tgaagtatga 780 agatgctgtt cagtttataa gacaaaaaag aagaggagca ttcaattcca aacagctgct 840 ttacttggag aaataccgac ctaagatgcg attacgctcc agagatacca acgggcactg 900 ctgtgttcag tagaagcaga ggaaggccgg ctggatcgtg gcattagagg gaactctggg 960 tacctggaaa tgtgaatctg gattcttacc tgtgtcatca aagtagtgat ggattccgta 1020 ctcctcgact cctcatgatt gagaagaagg caaacgataa agaaatccct ctataacacg 1080 aataaaatgt ttaag 1095 121 516 DNA Rattus sp. 121 aatacaagta aaagggggca gggcaactcc ttccccctcc aggtcaggac caggagaatc 60 tgctgggctg tccccgggac caaagaggaa aagagtgaca tagaaactga agcaaaggaa 120 gcttagtcac actcaggtga gggtgacagc tcctcctgga ttttgtttcc atttattaaa 180 aaaaaaaaga aaagaaagaa agaaaaagcc accccctcac tcccagccca ttcctcacag 240 ccagggtcag aaagcagcat cagtgaggcg ggttcctcac ctctggttat ctctggccca 300 ggtcagcttg agccacctgc cctcaccagg agagggtttc agttggcagt taggcttggg 360 gaagtctcta cctggacccc ccagaggcct gggagcaccc ccctcctccc aggaaaggga 420 atgcagtgtc tactgggctc agaggggtgg cctcacccac ctgacatgag tcctgattct 480 cccatctcga ggacggcagg aagtttattg caccag 516 122 484 DNA Rattus sp. 122 aaaataactg catcctttaa tggcagtaat acaattactg gattaagaga ccacatgaga 60 aggcaggtga ggtttctgga agacagatac tgagtacaaa aggggcggga gagccatgcg 120 acgatcattg taaaaataca gtacgttata tacatatttg caccatcgac tttcaactca 180 gaaatactat ttacacgttt gttacaatcc tggttagaga acaatctttc tttacgagtg 240 tggcctgacg gcaagtgggg attcaagcgg atgtccttgt tccatgaggg ccttcagtta 300 actgttccgg aggaaggctc cagactccac ctcacccaca gatttcagcc ttaaaaattc 360 atttaatttt ttctttagca aaaggacagc ttttttctaa tatagaccag gctggctttg 420 aacctgtgat cctcctgccc ctcctcctga gtgccgggat gacaggcgca ctaccaagcc 480 tggc 484 123 278 DNA Rattus norvegicus 123 ttgcaagctt tctgagcttc tgcattcaga ccacacagaa catgtaaata tttatacaca 60 gaccggagaa ctagcacacg ccttacagcc tgtgtgtccc ctgccttcct ctctcttctc 120 tcttaagaag tgcggagtgt ttttttttct tttttgcgga tttgcctaga ggtgccccag 180 gcccccgctc ctagagaggg tgacctttct gtttctcctt ctccttgttc tccagagggg 240 caatgtgagg gtgatggagc cgggtccctt ctggtgcc 278 124 569 DNA Rattus norvegicus 124 tttttttttt tttttttcca aaggcgcata aaaatatatt tcaactttaa aaataactta 60 gttacagtaa taccttgctt gtttttaacc aacatgtagc tgacagtcaa acttttgcaa 120 cacggaaata gtatacagag atatataaga accacaagga aaaaaaaatc ccaagtcccg 180 taaggttgtt caaatatgaa aagagtaaaa tcttaacact ggagaacttg ctatggtgaa 240 cccaaaagat acaatataca aaagagtcaa ggaaaaaaaa aaggcgtagg tgcgaagtaa 300 gttttgattt ccttcttcca acatgctgta aatcgccttg gagagaccac ggacccagcc 360 ctactagaca gcctggatgt gtgcacatct acaaacaaac aaaaccaagt gtgtttgagc 420 accgacaatg cctgcttaaa tgagctctga ctcacttcct catttccttt tttctgccag 480 aacaggtttt ttttttttct tcaattttta aaaattgttg ttttcaaaat ttttgcaaac 540 agggcatgat ggttaaaaaa atattttgt 569 125 486 DNA Rattus sp. misc_feature (1)..(486) where n may be a or g or c or t/u, unknown, or other 125 aaagccatat gctacacata taacaccttc tctctctgta aggttgacat tattcagaga 60 gacagtcaga ccacaaacca caaaacaaca gtcagacaac aagcacttta cagcgcccac 120 cacagtctgt acactaagaa cagtcctaag acggcatccg ccttctcctt cttcaatcag 180 gatgaagctg tcactgacat tccagtaaca tgctgacgtg gttgtcattg tttcaagtca 240 caccaaactg acatcagaga gtacaaggtt ctttaatcaa acaaaaaatt agtcttattt 300 aaattaagag aaattcactt aaactaaaaa taatttattt gagctgagag ttttaaatac 360 ctaacacatt ttttcaggga aatgttggta ccatctctga gccagctttc cttcctccag 420 actgtatctg tataagaata tataatttac ttatctctca gtcattttca gannacacgt 480 atctct 486 126 2452 DNA Rattus norvegicus 126 cggaggcgca tcggcactgc ctggttctgg gggctggggt agggaagagt ggggctcact 60 cctacaccca ccgatgaagt ctgtgagcct cctgatccgg tcttgctgag agtgtgggcc 120 tatgcccctc ccatgtggga ggtgggatag aaagtccctt tcctcagcct ctctgagtgg 180 gtttggtatc tcttcgaaag ggtgaggtgg ctttgacccc gggttgcccg ccagcgcgac 240 cgaggaggtg gctggacagc tggagaatga acggagaagc cgactgtccc acagacctgg 300 aaatggccgc ccccaaaggc caagaccgct ggtcccagga agatatgcta actttgctgg 360 aatgcatgaa gaacaacctt ccatccaatg acagctccaa gttcaagacc acagagtcgc 420 atatggactg ggaaaaagtc gcatttaagg atttttcagg agatatgtgc aagctcaaat 480 gggtggagat ttctaacgag gtaaggaagt tccgtacttt gacagaattg atccttgatg 540 ctcaggaaca tgttaaaaac ccttacaaag gcaaaaaact caagaaacac ccggattttc 600 caaagaaacc tctcaccccc tacttccgct tcttcatgga gaagcgggcc aagtacgcga 660 aactccaccc tgagatgagc aacctggacc tgactaagat cctgtctaag aaatacaagg 720 agcttccaga gaagaagaag atgaaatata ttcaggactt ccagagggag aaacaggagt 780 tcgagcgaaa cctggcccga ttcagggagg atcaccctga ccttatccag aatgccaaga 840 agtcggacat ccccgagaag cccaaaactc cccagcaact gtggtacacc catgagaaga 900 aggtgtatct caaagtgcgg ccggatgaga ttatgcgtga ttatatccag aagcaccctg 960 agctaaacat cagcgaggaa ggtatcacca agtctaccct cactaaggcc gaacgccagc 1020 tcaaggacaa atttgatgga cgacccacca agccacctcc gaacagctac tccctgtact 1080 gtgcagagct catggcaaac atgaaggatg tgcccagcac agagcgcatg gtgctatgca 1140 gccagcagtg gaagctgctc tcccagaagg aaaaggatgc ttatcacaag aagtgtgacc 1200 agaaaaagaa agattatgaa gtggagctgc tgcgtttcct tgagagcttg ccagaggaag 1260 agcagcagcg ggtcctggga gaggagaaga tgttgaacat caataagaag cagaccacca 1320 gtccagcctc caagaagcct tcacaggaag gtggcaaagg tggttcggag aagcccaagc 1380 ggcctgtgtc tgccatgttc atcttctcag aggagaagag aaggcagctt caggaggaac 1440 gacctgagct ctcagaaagt gagctgaccc gcctgctggc ccgcatgtgg aacgacttgt 1500 ccgagaagaa gaaggctaaa tataaggccc gggaggccgc gctgaaggca cagtccgaga 1560 ggaagcctgg cggagagcgt gaagataggg gcaagttgcc agagtcgccc aagagagctg 1620 aggagatctg gcagcagagt gtcatcggag actatctggc ccgctttaag aatgaccggg 1680 tgaaagcctt gaaagccatg gagatgacgt ggaacaacat ggagaagaag gagaagctca 1740 tgtggattaa gaaggctgca gaagaccaaa aacgatatga gagagagtta agtgagatgc 1800 gggcccctcc agctgctacg aactcttcca agaagatgaa gttccaggga gagcccaaga 1860 aaccgcctat gaacggttac cagaagttct cccaggagct gctgtccaat ggggaactaa 1920 atcacctgcc actcaaggag cgcatggtgg agatcggcag ccgttggcag cgcatctccc 1980 agagccagaa ggagcactat aagaagctgg ctgaggaaca gcagaggcag tacaaggtgc 2040 acttggacct ctgggtcaag agcctatctc cccaggaccg cgcagcatac aaagaataca 2100 tctccaataa acgtaagaac atgactaagc ttcgaggccc aaaccccaag tctagccgga 2160 ccaccctgca gtccaagtcg gagtcagagg aagacgacga tgaggaagat gatgatgatg 2220 atgacgagga ggaagaagaa gatgatgaga acggcgactc gtccgaggac ggtggggatt 2280 cttctgagtc gagcagtgaa gatgaaagcg aggacgggga tgagaatgag gatgatgacg 2340 acgacgagga tgacgacgag gatgacgatg aggatgaaga caacgagtct gagggcagta 2400 gctccagctc ctcatcttca ggggactcct cagactctga ctccaactga gg 2452 127 320 DNA Rattus sp. 127 cagctttgaa ccctgaaact gcttgtggca ggttcagtta ctgacagcac aaattccatt 60 gagtctattc aaggattgaa tgacttcaca agccctggtc tcaggagatt aactttcata 120 ttggggtagt ggttcacttg agaatacaaa atcttcagat taactgatgt ccaaaatact 180 gtcttaaatc atgagatcca tcaattttta gtattgtcta gacctgcaaa tagaattaca 240 ttgtaaaatc tttttcagca tgtgttaaga tgtatgtgaa atttttttgt ttaagtgtaa 300 acttcatacc atactgtcag 320 128 2190 DNA Rattus sp. 128 ctgttagcca gaactaagtc actgcccaca cagcaattac accatgaatc tctaacatca 60 caaccttctt tcaaatactg taaaaaaaaa aatactggaa aaccttttta gatttgtttc 120 aatctaccac gcttctgaaa tgttcagcta ttaaggggaa tgtcaatata tatatacagc 180 ccaaattagc gaagcccaaa ttagggaagc acatagcatg gcatacttga gaggttggca 240 aacaagaggg aagcaggtac tttctgcaaa aacagggact gtgcgccaca atgagaccgc 300 tgtgacctta cttcaaactg tgaaactgtg tgcagcagta ggatttggag tcttaaattc 360 agaaggagca gaaagaaaga gagttagacg tttgaattcc tggtgcttat gctgacgttt 420 tctctcattt gccagcatta aaatctcaag taaggtgtct cctgtgccag gacgtgtcag 480 gtagaccttt ctcagtgagc tggaatcctg ccttctgtgt catagagttg taagagcagc 540 agtttctacc catcatgctg tgtactcact gctgcagagc aagtgggagg ggccagagat 600 gggccgtccc caggggcgga tagcgcggta cacttaattc tcttctgcct gaaagcacac 660 ttccccgtaa gcgaatacag acagaacttt gattgagctg gtatctgacc gataggattg 720 cttctttgat gtacctcttt gacctagcta gcatttcagc tttgatactg cagaacgatc 780 ttttcaaatt ataatcattc tgatagagat atcttaatag acgtgctttt aaaaacaaaa 840 cggaaactac tgtcagtatg aatactgagc cagactggca ttcatagatt taacaacttg 900 tatttctaag attcttaact ctataaaagt aatatggctt ttagatatat aggataataa 960 tttcagtgag accgttatct ctttactcaa cattatgtta gggacagtat aagccaagca 1020 cttacctgtt acacattgga gactaaaacg actgccccca accttagtaa gtatgaaaac 1080 tagactcaca ttatttcatt tttaactgct aaaagtatgt ctatagaatt taaaatttaa 1140 gcactactat ttgtcctggc cacatttttc aaaaattaag ttaaaagtta ttaattatat 1200 acaggtgtgt atttctaata attaaaatac ctttcaaatc catggaatgt ctgcctttta 1260 aatgtaattt gggctttttt gtttgattct ttttcactta actagctgtt tatttgtaaa 1320 catttctttc cacgatttaa aaacacttcc aatcggcttt acttcctgat ttagcacttc 1380 ctattttgac ccttgggcat ctgttcttcg tttaattggt gtagaataat gataaaaatc 1440 ccaagctagt ctaaccaaac tgcattctta agagtactaa gtgggaatga tgcaaagttc 1500 atcaagaatt gaaaagaagc cggtctttgc actaaggaag acagcataat taagtcttct 1560 cgttgccatc tgagtaatat catgatatgg ggaccgcaga atccatctcc ggtgggatga 1620 tggtatagct ttcctagttc cagaaaaaat ctccttcaca ttttatataa taatttaatc 1680 acaatgtcca tcgaattctt ccttaggcat tctttgtaac agtagtgtgc tgctctttaa 1740 aatattaacc tctgacacat gtgtgaacct cattccctgt ggtcactaaa atttcttccc 1800 cactgagcat caatctttag cagattttag gaaaatactg aattcttagt caggaaatat 1860 tttaggaata atcttgctaa cagtatatta agtaataaaa ttacccactc tatgtgtgtt 1920 tcattttttt taatttacat agacaatggt ttttataaag caatgatttc aatttttccc 1980 agctacccaa agtcctggtt aatttgtgaa cacaaatttt gttagccttc aagtttaaac 2040 ggtgctgaat tgtcttaact tgaatgtaaa aatggattaa ggcttatctc atgggacaca 2100 aacatgtccc atatgactgc ttgctgcgta ctaggggaca tgccatttgg tgttaaattg 2160 tctataataa agttcggttt ctcccaaaaa 2190 129 1592 DNA Rattus norvegicus 129 ccaacaccat gcgcgagatc gtgcacatcc aggcgggcca atgcggcaac cagatcggcc 60 ctaagttttg ggaggtgata agcgatgagc atggcatcga cccgacgggc agctaccatg 120 gcgacagtga cttgcagctg gagagaatca atgtgtacta caatgaagct gctggcaaca 180 aatatgtacc tcgggccatc ctagtggacc tggagccagg caccatggac tcagtgaggt 240 cgggaccatt cggccagatc ttcaggccag acaactttgt gttcggtcag agtggtgcag 300 gaaataactg ggcaaagggc cactacacag agggtgccga gctggtggac tctgtcctgg 360 atgtggtcag gaaggagtca gaaagctgtg actgtctcca gggctttcag ctgacccact 420 cattgggggg aggcactggc tcaggcatgg ggaccctgct catcagcaag atcagagaag 480 agtacccaga ccgcatcatg aacaccttca gcgtcatgcc ctcacccaag gtgtcggaca 540 ctgtggtgga gccctataat gccacccttt ccgtgcacca gctggtagag aacacagacg 600 aaacctactg catcgacaac gaggctctgt atgacatctg cttccgcacc ctgaagctga 660 ccacacccac ctatggcgat ctcaaccacc tggtgtcagc caccatgagt ggagtgacca 720 cctgcctgcg cttccctggc cagctgaacg cagacctgcg caagctggct gtgaacatgg 780 tgcctttccc acgcctgcac ttcttcatgc caggcttcgc acctctgacc agcaggggca 840 gccagcagta ccgagccctg acagtgcccg agctcaccca gcagatgttc gactccaaga 900 acatgatggc tgcctgcgac ccacgccatg gccgctacct gaccgtagcc gccattttcc 960 ggggccgcat gtccatgaag gaggtggatg agcagatgct caacgtgcag aacaagaaca 1020 gcagctactt cgtggaatgg atccccaaca atgtgaagac ggccgtgtgt gacatccctc 1080 ctcgtggcct caagatgtcc gccaccttca ttggcaacag caccgccatc caagagctgt 1140 tcaagcgcat ctcggagcag ttcactgcca tgttccggcg caaggccttc ctgcactggt 1200 acacgggcga gggcatggac gagatggagt tcaccgaggc ggagagcaac atgaatgagc 1260 tggtgtctga gtaccagcag taccaggatg ccacggctga tgagcagggc gagttcgagg 1320 aggaggaggg tgaggatgag gcttgagttc ccaggccaag caggttaggg aaagctgagg 1380 cgaaaggagg gggtgggggt cttaatctgt gaaaatacct tggcagttgg aagaaggaga 1440 atggtcttag gtttgtgctg ggtctctggt gctcttactg ttgcctctca cttttttctc 1500 tttttgtaat atcgatgacg tgatgtgatg cttgagatct ttctgaactc ctgttgtgat 1560 ggctgaaatc gcctgaacct ttgtgtccta aa 1592 130 3043 DNA Rattus sp. 130 ggaagacaga agcgtgggag tggagaaagt ggcacaaacc agagccctgg agatactctg 60 gaaacgcgga gcagattgtg ttggaaagct ggcaacagcg gtggtctcgg gcgcggcggg 120 agggcaggct gggtcggggg cgggcagtgt gtcgtcagga gctgggcggc cccgcggcgg 180 ccgcgtcttc ctagcccatc ccctcaagcc cagctcgggc tccggcgccc cagctccgac 240 cgtgcgctct cctgggtcaa gttagagccc accgagagcc ccgcgccact cgcgcactct 300 cgcacccggg cagaaggact gcggaagggg ccgccccacg ggctgggtga gggctgcagc 360 gtccagagaa ggaagcgttg acagctggag ccggctgctg gaagggtttg cgccgggacg 420 cgggggttgc cgtagcgcta cgcaagacgg ggcttaccct ctgaaaagca aacataagag 480 gctgtacaac caggttatct ctgcgatcag cctcaagcag agaagaaaga tggccaaaga 540 caagggcctc atcagcccag aagactttgc ccagctgcaa aagtacatag actactccac 600 caaaagtgtc agcgatgtgc tgaaggtctt tgagatgaac aaatattgcc aaggagatga 660 gattgggtac ctgggatttg aacagttcct gaaaatgtat ctggaagtgg aggaggttcc 720 ccatcaccta tgctggactc tgttttggtc cttccatagc agtcaagact tggatgagga 780 gactgagtca aaagccaatg tgatctgtct cagtgacgtc tactgctact tcaccctcct 840 ggaaggcggc agtccggaag acaagctaga gttcaccttc aagctgtacg acatggacag 900 aaatgggatc ctagacagca cagaagtaga aaaaatcatc ctgcaaatga tgagagtggc 960 tgaatatctg gactgggatg tgtctgagct gagaccgatc cttcaggaga tgatgaaaga 1020 gatggaccgg gatggcagtg gctgtgtctc cctagctgag tgggtccggg ctggggctac 1080 cactgtgccc ctgcttgtcc ttctggggat agacatgact atgaaagatg atggacacca 1140 tatatggaga cccaagagat tctccagacc ggtctactgc aacctgtgcg agttgagcat 1200 tggcctcggc aaacaaggcc tgagctgtaa cctctgtaag tacattgttc atgaccactg 1260 tgccatgaag gcccagcctt gtgaagtcag cacctatgcc aagtctcgga aagacattgg 1320 tgtccagcca cacgtatggg ttcgaggagg ctgtcattct ggacgctgcg accgctgtca 1380 gaaaaagatc cggacctacc acagcctaac gggactgcac tgcgtgtggt gccacctgga 1440 gatccacgat gactgtctgc aggctgttgg tcccgagtgt gactgtggac tgctccgtga 1500 tcatatcctg cctccgtgtt ctatctaccc cagggtcctg gtatctggac aggagtgcaa 1560 acagaagacc acagatgtta cgagcctgtg cacccctgag gcttttcgga ttgaacctgt 1620 ttctaacacc cacccccttc tggtcttcat caatcctaag agcggaggca agcaggggca 1680 gagcgtgctt tggaagttcc agtacattct gaaccctcgg caggtgttta acctgaagga 1740 tggtccggag ccagggctca ggtttttcaa agacgttcct cagttccggg tgttggtgtg 1800 tggtggagac ggcaccgtag gctggattct agagaccatt gacaaagcca actttcccat 1860 tgtgcctcca gtcgctgtgt tgcccctggg cactggaaat gacctggctc ggtgcctaag 1920 atggggaaga ggttatgaag gtgagaactt gagaaagatt ctcaaggata tagagataag 1980 taaggtggta tatctcgatc gatggctcct ggaagtgata ccccaacaaa acggagaaaa 2040 gagtgatcca gttccctctc aaatcatcaa taactacttc tccattggtg tggatgcttc 2100 cattgctcac cggttccatc tcatgagaga gaaataccct gagaagttca atagcagaat 2160 gaagaacaag ctttggtact tggagtttgc cacatcggag tccatcttct caacgtgcaa 2220 aaagctggaa gagtcagtga ccgtcgagat atgtgggaag ctgctggatc tgagtgacct 2280 gtccctcgaa ggcattgcgg tattgaatat cccgagcatg catggtggct ccaatctctg 2340 gggtgacacc aagagacctc acggggatac gtgtgggatc aaccaggcac tgggcagtgt 2400 ggccaaaata atcacagacc ccgatattct caaaacctgt gtgccagaca tgagtgacaa 2460 gcggctggaa gtcgtaggaa tagagggtgt gattgagatg ggtcagatct ataccaggct 2520 caagagtgct ggacaccggc tggccaagtg ctccgagatc acgttccaga ccacaaaaac 2580 cctccctatg caagttgatg gagaaccctg gatgcaagca ccctgtacaa tcaagatcac 2640 ccataagaac cagatgccta tgctaatggg tccggctccc agttcctcca atttctttgg 2700 cttttggagc tgaggatgga tgccatctgc cttgagccca cctccctgtt cctggagatt 2760 tcccactatc tagatgctgc cacaccttcc tgccagccca gaaggatgtt ccatcacctt 2820 cacagtattt attatcctcc gccacctcac tgctcccaca cagacatcct tacacaacca 2880 gcgatgcgta accttgaaaa tgcctcatct aataaagtga ctttttccat cactgggata 2940 tctgttaaaa tgagcgacac atctcttttt acaccttcac ccctctacag acagatttaa 3000 aagcagacag aacaaataaa tgaacaaaga aagcctaaaa aaa 3043 131 383 DNA Rattus sp. 131 ggagttaaaa ggtgtttaat gaggggaaga atattgaaca tatacttgtc ttattcccca 60 cccccattca gatgaatcct gagccatgga aagattagcc attcaatgag actttgggta 120 atgttgagtg gctcgagaac tcttgaggat gaggatgaga gacaactttt taggtggtgc 180 aaggatggag gattaagact ggaggtcaca gcatgtgtag cacaaggcaa tgggactcat 240 aagcaggatg gtaagcagcc tcagcatctg gtgatcagat cttcaggctt ggctatgggc 300 aatattactt ctgctttgtc ttctgctggt atggtgatgg agtggcagtt ggggggcagg 360 gctcatgaac aacaggctgg cat 383 132 672 DNA Rattus sp. misc_feature (1)..(672) where n may be a or g or c or t/u, unknown, or other 132 caactaccag gattgcattt attataacag actgaaagtg caagccgaga gagaggtctg 60 agaagcagcg taaacacagg gagcactcct gaggaagctc agatgcactc cttcccaggg 120 actgcccctg aagaacaccc agccctcgga attaaggcag ttaggatgtg ggcactgttg 180 tgccaatatt ggtggtgttt tcacccagcc cctaccagct gtccctttct atccctccta 240 actggacatt attgagcttt ctccacgagt aatgctttcc cacacaaggc cacccagctc 300 aagaactttc cttccaggga cggttccagc caagtacctc aatgttaact attaactaca 360 gcaacaacca cccagagact gtggtttggg ggtactgccc acccccagga gctgccaaat 420 gtccaggcta ctgtgttcta accaaataga aacagagctc tacacttcag ttccacaacc 480 acttctggcc ctcactgagc cctgccaagt ccttactctg ccctacatgt attccctttt 540 cacacgaggc ctccaccctg cagacttaca gaaggccggg atatggtttg tgctccttcn 600 ctgcgggcct tacataaagt gctcagaatc agagattctt gcactgaatt gcagactccc 660 tcatgccgaa tt 672 133 367 DNA Rattus sp. misc_feature (1)..(367) where n may be a or g or c or t/u, unknown, or other 133 tttgaactct gaattagaca gtttgtattc taaaacacac acaaatctgt accaattaaa 60 tacaaagtcg aaagggaata taaattacag ttttngtttt caacggatga tagtaaaaca 120 ctaccgtcag ctcaccacac acacaaacag cagcttttca atagagtctc ccaggcctcc 180 ctcaccaatg ctaaggactt aaaaaataaa aagtacagca ggctgcacat gctctctcac 240 ttcctcacaa tacacggata gctgtacact gaggtccatt gtacaggcat ctacagagta 300 agtacataaa atatatttgt aaaattcttc cacgcacgat cctacaatgt ctctcctcgc 360 aggatac 367 134 2225 DNA Rattus norvegicus 134 tttcagggat ttttgcgatt cctctctgta gacttctact tgttctctaa gggagttctt 60 catgtctttc ttgaagtcat ccagcatcat gatcaaatat gattttgaaa ctagatcttg 120 cttttctggt gtgtttggat attccatgtt tgttttggtg ggagaattgg gctccgatga 180 tggcatgtag tcttggtttc tgttgcttgg tttcctgcgc ttgcctctcg ccatcagatt 240 atctctagtg ttactttgtt ctgctatttc tgacagtggc tagactgtcc tataagcctg 300 tgtgtcagga gtgctgtaga ccttttttcc tctctttcag tcagttatgg gacagagtgt 360 tctgcttttg ggcgtgtagt ttttcctctc tacaggtctt cagctgttcc tgtgggcctg 420 tgtcttgagt tcaccaggca gctttcttgc agcagaaaat ttggtcatac ctgtgatcct 480 gaggctcaag ttcgctcgtg gggtgctgtc caggggctct ctgcagcggg cacaaccagg 540 aagacctgtg cggccccttc cggagcttca gtgcaccagg gttccagatg gcctttggcg 600 ttttcctctg gcgtccgaga tgtatgtaca gagagcagtc tcttctggtt tcccaggctt 660 gtctgcctct ctgaaggttc agctctccct cccacgggat ttgggtgcag agaactgttt 720 atccggtctg tttctttcag gttccggtgg tgtctcaggc aggtgtcgtt cctgcgccct 780 cccccatggg accagaggcc ttatacagtt tcctcttggg ccagggatgt gggcaggggt 840 gagcagtgtt ggtggtctct tccgtctgca gcctcaggag tgccacctga ccaggcggtt 900 gggtctctct ctgagaattt catttttaaa tcattcatta aaatgtcatg acttgatgtc 960 ctgctgtccg tctcacgccc tcagctgtaa cagtgccgag ggagtcactg aagaagagac 1020 tgaatgacca gagtatgggc agcacagaca actcaacaaa aatgtcttca gaggtggaga 1080 ctgcggaggc cgtagatgag tcagagaaga actctatggc atcagagaag gaaaaccatt 1140 ccaaaatagc agacttttct gatcttctga aggaagggac aaaggaagca gatgaccggg 1200 cagaaaatac ccagtttgtc aaagacttct tgaaaggaaa cattaagaag gagctattta 1260 agctggccac cactgcactt tcatactcag cccctgagga ggaaatggat tcactgacca 1320 aggacatgga gtacttcttt ggtgaaaact gggaggaaaa agtgaagtgc tctgaagctg 1380 cccagacgta tgtggatcag attcactatg tagggcaaaa tgagccagag catctggtgg 1440 cccatactta ctctacttac atggggggaa acctttcagg ggaccaggta ctgaagaagg 1500 agacccagcc ggtccccttc actagggaag ggactcagtt ctacctgttt gagcatgtag 1560 acaatgctaa gcaattcaag ctattctact gcgctagatt gaatgccttg gacctgaatt 1620 tgaagaccaa agagaggatt gtggaggaag ccaccaaagc ctttgaatat aatatgcaga 1680 tattcagtga actggaccag gcaggctcca taccagtaag agaaacccta aagaatgggc 1740 tctcaatact tgatgggaag ggaggtgtat gcaaatgtcc ctttaatgct gctcagccag 1800 acaaaggtac cctgggaggc agcaactgcc ctttccagat gtccatggcc ttgctgagga 1860 agcctaactt gcagctcatt ctagttgcca gtatggcctt ggtagctgga cttttagcct 1920 ggtactacat gtgaagggcc tgtcaagttg tttgcatcct atctcaacat cctaccactt 1980 gttccttccc cacctccacc tctgcctaga actaccacct caggtgacat ttttaatgtt 2040 gggtttgaga aaatgagcaa ccaataaaag acagacccta gaaaaaagtc atgacttaag 2100 tggcacgggg acacctaaag tcacactttg tgcttcagac atactttctt tctctatttc 2160 aacactgaat tcgggaagta acctactact attaataata aatgctacac aatgcataat 2220 aaaaa 2225 135 467 DNA Rattus sp. 135 caaggtcaca cgtgatttaa tgtaggtggc aatgaaactg ggtttggtga gctacatcgt 60 taaaacggaa ggcagccttc ctagaatctc aatgcattgg cttaggagta agcaatactg 120 aaaaaagtta aagcatctgt tggcctcttt cccatcacag ggtacaacaa cctcttgtag 180 tcagcttcct ctacagtagg ctctaaagag ttcagcatga accaagatgg gaataatttg 240 gtgaccaaac tgcattttcc aaagtccttt tgtggcagaa tctagtaact tattgagctt 300 caggatggat ctagtttcct tttcttcagg tcctggctct tcctatgaga ttcagagttt 360 tctctgctac ctaggacatg ctgattagtg ctgcaggtct gcaggtgccc aggtccaggt 420 ttttcttgta ctttctggtg gcaatggata tgtgggacac tgtcctg 467 136 822 DNA Rattus norvegicus 136 ggctgtggag gcaatggcct ggtgctgtcg gctacggaca gcagggcttg ccggaaggta 60 gccgtgaaga agattgtgct gagtgacgca cgaagcatga agcacgcgct ccgagagatc 120 aaaatcatcc ggcgcctgga ccacgacaac atcgtcaaag tgtacgaggt actgggaccc 180 aagggtagcg atctgcaggg cgagctcttt aagttcagcg tggcttatat tgtccaagag 240 tacatggaga ctgacctggc atgcctgcta gagcagggca cgctgaccga ggagcacgct 300 aagctattca tgtaccagct gctgcgtggg ctcaagtaca tccactctgc caacgtgttg 360 cacagggacc tgaagcccgc caacattttc atcagcacgg aggacctcgt gctgaagatc 420 ggggatttcg ggctggccag aatcgcggac cagcattact cccacaaggg ttatctgtcg 480 gaagggttgg tgacaaagtg gtaccgctct ccacgactgc tcctgtcccc gaacaactac 540 acgaaagcca tcgatatgtg ggcagctggc tgcatcctag cggagatgct cacggggaaa 600 atgctctttg ctggggctca cgagcttgag cagatgcagc tcatcctaga caccatccct 660 gtagtgcggg aggaagacaa ggaggagctg ctcagggtga tgccgtcctt tgtcagcagc 720 acttgggagg tgaagaggcc actgcgtaag ctactcccgg atgtcaaccg tgaagccatt 780 gactttctgg agaagatcct gacgttcagc cccatggacc gg 822 137 512 DNA Rattus norvegicus 137 ttgtcttaac acttttttat tgattcattt gtgtttaccc atgtttcccc tcatttaaaa 60 aaaaaaatca gtgtttctaa tttcttggat tttcaacttt aagttccacg ttgacaaatc 120 tatttttatg tacacctttg attcttacat tacaacagaa gtcacatgtt taaaaatacc 180 cactccttaa aacatagtca ttgtcaatat tttgtttaca gtacagatcc tgaagacaaa 240 cacttgacta tattaagatt tattagtctg aaaacagaat gtaggatatt ttgtatatac 300 cgtcggtata tactgaattc tggcagcaaa gtgtttatga gaaaatttca attttccctg 360 gtagtataat ataaaggcca atctacttct cacttgttac aactatattt aactgaatta 420 tggtttaaaa tcacaaagtg ataaaatata aatacaaaat actaagaaaa gtttaacata 480 taaagagaaa gtgtcttatg tattgtaatt tt 512 138 451 DNA Rattus norvegicus misc_feature (1)..(451) where n may be a or g or c or t/u, unknown, or other 138 tttttttttt tttttttata ttaaatgact attttattta cacaccctat catatccata 60 aataattgcg ccctctttgt ctgcacaatg gttccttgac atctaacgat tacagagtga 120 acatctaaaa gtagatgggg gcttactccg gcttaggaca tggtctcttg tgtacactca 180 atatgcacac ccagtaacac agacatagac cagagagggt ggctcccagg gcctcctggg 240 ctcccagcac caatgggaaa accttgccca tacacaggtg agttcctgac aactggagaa 300 cagagagatg caggccaggg ctccgcccca cccaccgcag ggccccgccc acctctggag 360 cagctccacc tttgttcacc ctggaaggca tctctttgga ttgcaaatac tttaattcac 420 agaggggatg gggtangggc tggggtggtg t 451 139 3208 DNA Rattus sp. misc_feature (1)..(3208) where n may be a or g or c or t/u, unknown, or other 139 gaattccgcc taagaatagc aaagtagtaa agagagattt ggatgatgat gtcattgaat 60 ctgtcaaaga cctcctgtcc aatgaagact cagtggaaga tgtttctaag aagagcgaac 120 taattgttga tgttcaagaa gagaaagata cagatgctga agatggatct gaagttgatg 180 atgaaaggcc agcttggaac agtaagctgc aatacatcct ggcgcaagtt ggattttctg 240 taggtctggg gaatgtgtgg cggttcccat acctctgcca aaagaatggc ggtggtgcat 300 atcttttgcc atatttaata ctacttctgg tgataggaat tccactcttc ttcttggaac 360 tttctgtggg tcaaagaatt cggagaggca gcatcggggt ttggaattac ataagcccta 420 aactgggtgg gattggattc gcaagctgtg tagtgtgcta ttttgtggct ctctactaca 480 atgtcatcat tggctggaca ttgttttact tttctcagtc ttttcaacaa cctctccctt 540 gggatcaatg ccccctggtg aaaaacgcat ctcatactta tattgagcca gaatgtgaaa 600 aaagttctgc caccacttat tactggtacc gggaagcact ggctatcagc agctccatct 660 ctgaaagtgg gggcttaaac tggaagatga ctggctgcct gctggctgcc tgggtcatgg 720 tttgtctggc catgatcaaa ggcattcagt cctctggaaa aatcatgtat tttagttctc 780 tgttccccta cgtggtactt atatgcttcc taataagatc tctcctttta aatggttcaa 840 ttgatggcat ccgacacatg ttcaccccta agcttgaaat gatgctggag cccaaggttt 900 ggagagaggc agcgactcaa gtcttctttg ccttgggttt gggatttggt ggagtcatcg 960 cgttttccag ttacaacaag agagacaaca actgccactt cgacgctgtc ctcgtgtctt 1020 ttatcaactt tttcacttca gtcctggcaa cactggtggt gttcgcagtt ctggggttca 1080 aagccaatat cgtaaatgaa aagtgcattt cacaaaattc tgagatgatc ttgaaacttt 1140 tgaaaacggg aaacgttagt tgggatgtca ttccccgtca catcaacctt tcagctgtca 1200 ctgcggaaga ttatcatgta gtttacgaca tcattcaaaa agtgaaggag gaggagtttg 1260 ctgttctcca tctcaaagcc tgtcaaattg aggatgagct aaataaagct gtgcagggca 1320 ctggcttggc tttcattgcc tttacagagg ccatgacaca ttttcctgca tctcccttct 1380 ggtcagtgat gtttttcctc atgctgataa atctcgggct cggcagcatg tttggaacca 1440 ttgaaggaat catcactcct gttgtggaca cattcaaagt gaggaaggaa atactcaccg 1500 ttatctgttg cctcctagca ttttgtatcg gcttgatgtt tgtgcagcgc tctggaaatt 1560 acttcgttac aatgtttgat gattattctg ccacattgcc tctgttaatc gtggtcatct 1620 tggagaatat tgctgtaagc ttcgtttacg gcatagataa gtttctagaa gacctaacag 1680 acatgttagg atttgctcca agcaaatatt actattacat gtggaaatac atttctcctc 1740 taatgctagt aacattgtta atagctagca ttgtgaatat gggattaagt cctccgggat 1800 ataatgcatg gatcaaggag aaggcatccg aagaatttct gagctacccg atgtggggga 1860 tggtggtctg tttctctctg atggtgctgg ctatacttcc tgtcccagtc gttttcgtca 1920 ttcgtcgctg caacctcata gatgatagtt ctggtaactt ggcctctgtg acctataaga 1980 gaggaagagt cctgaaagaa cccgtgaact tagacggaga tgatgcaagc ctcattcacg 2040 gaaagatacc aagtgaaatg tcgtctccaa attttggtaa aaatatctat cgaaaacaga 2100 gtggttctcc gaccctggac actgctccca atggacgcta tgggattggg tatttgatgg 2160 cagatatgcc agatatgccc gagtctgact tgtagctgag tgaaagcaaa gggttgagct 2220 tggttcattt ttatcaatga gcattggttc tactatgaga agcagtgagc ttcacttgtc 2280 acaaggtgat ctcaggtgtc catagccgcc atctttaatc tcaacagttt aagacagttc 2340 cagaagagca atcctcagtt tacaattaca aagtaacaat tgcagacaaa gcttacattg 2400 actggggtcc tttgccagga tttttttaaa aagcactttg acatactttc aagtatttct 2460 atctcttaaa aaaaaggtgt tacctcagtt tctaatagtt tccggattta atattattgg 2520 cgatttgaaa aaaaatccct gttatatctt acaattcata attttgcctt cggagtaagt 2580 tccagtatta ccatgaacag ttgtgtgtga gcgggtgctt ctcagcacat tgccatgagt 2640 acgttctgta gatagcctgt acttatcttt ggtagcattg aaaccttagg cacttagttg 2700 gagaaaactt caaagtattt tcttatatga tagccgtcta gagcaatagt attaaagagt 2760 aaaaaggcac tgatggtgga tgagaggtta agtccagatg tgatcgaggg tttcctggag 2820 tggccatata ttttgtgtaa aataggtgtg tgcaaatgat tgaagggact ctcgagaatt 2880 atgcagactg catttttctt atgccgtgtg cctaataaac ctacttaata tttattgtgg 2940 ttttgagatc acttatagta tatttatata atatacttgc aatgtataga gatgcgcatc 3000 aggactctta agtgctggtt tgaaaacttg aagcaagata gcatctgatt tcatatgttt 3060 ctgttttgct tcattttatg caaatacgaa ttcttttttt taagtgattg ttaaaactgt 3120 atggcattac attttaacct acaaataaac gaagtttanc aaaaaaaaaa aaaaaaaaaa 3180 aaaaaaaaaa aaaaaaaaaa aggaattc 3208 140 436 DNA Rattus norvegicus 140 tttttttttt tttttttaca accattcata gaaattttat tggacaaagc aaaatgctct 60 tcaataagca aacaaccata aaaacatggt gcgtccgcac cactcagcag ggaagtcatg 120 caatttctgg aacagagcac aaggtattac actgagtgag agcatctcca atgcttctgt 180 caacacgctt ccatctcagc gctgcagagc tggaggagag ttggtatggt agcgaggtat 240 gaagagttag gtggtagtga ggcagctggg tgactataag acagaatcaa cgggccctct 300 ggagtactga gctcgttcta tacattggtc atacataatt gcatgaatcg gcacacatga 360 taaaatagca cgaagccaca tacatactgt tccaattaac aagtccttgc ttttgatatg 420 ttacaccata tgttct 436 141 636 DNA Rattus sp. misc_feature (1)..(636) where n may be a or g or c or t/u, unknown, or other 141 agcattttca aactttattt acaactgtca cagtgacaaa aagtagtttg gaaaaaaaaa 60 atgctagttt ctccctgagc ctcgatacag aacagacaga agtcacagga ggttcatctc 120 acaacaggca tgtcactgaa atactaggat tttttttttc aatacgatca gttagaaata 180 cacacaaatt acttaaagaa agaagaggag gaagagggga agagaaagag aaagagaaag 240 agaaagagaa gaagaggcca gacaggagct cagccacttg tccaagagca gctgggtccc 300 cccaacaggc tcaaccgctg aggggcctga cgttagctat cagcccctga cctgctcaga 360 caacacacgg ttgtacaaca tggtctagtg accggcaaaa ggaagaagcc accccacaga 420 cacacatata cacaaagctg attgatactg gatttacaag cacatcccgt cacacggcaa 480 gaccaagaca gatcggggta ggggtgagaa gacaccaana cacaggaatt tcaaaggcca 540 aacacctgtc cataaaggga ggcgagggac gagaaatgcc gtgggagaag gggagaagga 600 tgaacagatc tctgtagcca aaacagaaca gagtgg 636 142 437 DNA Rattus sp. 142 acaaaaataa atgtagtctt tattaccaag taataaaata gaagccataa ttaactatag 60 cgtagggcag caggatgagg tgattaaatg aataatgtta cattgtcttg gggggaaact 120 aggttttcag aattacagtg tctggaattt tagtgcttaa aaaaaaaaaa catattttag 180 gaggaataag gaactggtag aacaaggaaa tggcttaact aatctgagtt aagagcacct 240 ctgagggcca catggtctgt aatcacagct ccagaggctg aaaaacagga gatctacagt 300 ttaaggtcag ccatggctag gctgaatctg ttgtctgttt aaagagcacc aagaaattcc 360 ctggtcaaac gagagcttct ggtgaaagac taaaggaatg gcagtaagga catagaagtg 420 gctccttgcc tcgtgcc 437 143 592 DNA Rattus sp. 143 cccgaagaca accaaagctt ccagtacgat catgaggcct tcctaggcaa ggaggattcc 60 aagaccttcg atcagctaag cccggacgag agcaaggaga ggctggggaa aattgttgat 120 cgaatcgaca gtgatggaga cggccttgtt actactgagg agctgaaagt ttggatcaaa 180 cgggtacaga aaagatacat ctacgataat gtggctaaag tctggaagga ttatgatagg 240 gacaaagacg aaaagatctc ctgggaagaa tacaagcagg ccacctatgg ctactacctg 300 ggaaaccctg ctgaattcca agatagctct gatcatcaca cctttaaaaa gatgctgcca 360 cgggatgaga ggaggtttaa ggcttcagac ctcgatggcg acctgacagc tactcgggag 420 gagttcactg cctttctgca cccagaggag tttgaacata tgaaggagat tgtagttctg 480 gaaaccctgg aggatatcga caagaacggg gatggttttg tggaccagga tgagtacatt 540 gcggacatgt tttctcacga ggacaatggc cctgagccag actgggtttt gt 592 144 3027 DNA Rattus norvegicus 144 ggcagtagct ggatgagggc gttgcttgcc tcctcccttt tttttctccg attggttctg 60 aggggtatat tcgagttgca aaatggcggc cccgagcgct ctcttcagcg ttcagtagca 120 gcttcaggct gagcggatgt ctcttctcct cagtttcgga ctcagagaca cgcggctccc 180 tactcctgct gatcacgaag tccttgaagg cgctcaacgc accggaatct cccagcggcc 240 gcgaccgccg cctcggccct gctctccgcg gcgccggaac tcagcgtgat ctgcggcggc 300 cgtctaggag gttcacaaaa atggcgaaga gagttgcgga gaaggaatta acagatagga 360 attgggatga agaagatgaa gttgaagaga tgggaacatt ctcagtggcc agtgaggaag 420 tcatgaagaa cagagccgtg aagaaggcaa agcgtaggaa tattggcttt gaatctgata 480 gcggaggagc ctttaaaggt ttcaaagggt tggttgtgcc ttctggagga ggagggtttt 540 ctggatttgg tggtggctct ggagggaagc ctctggaagg actgacaaat ggaaacagca 600 cagacagtgc cacgcccttc tccagtgcaa agacagcagc ggagcctaag gcagcctttg 660 gttcttttgc tgtgaatggc cctactactt tggtggataa aaagatctcc agtcctaaat 720 gcaatagcag caatcagccg ccctcctccg gcccagcctc cagtacctcc tgcactggga 780 atacctatca taagcagttg gctggcttga actgctctgt ccgggattgg atagtgaagc 840 atgttaacac aaacccactt tgtgacctga cgcccatttt taaagactac gagagatact 900 tagcgacgat cgagaagcag ctggagaatg ggggtagcag cagctcagag agacagacag 960 acagggcgac ggctgcaatg gagcctcctt ccctttttgg ttcaacaaaa ctacagcaag 1020 attcaccatt ttcatttcat ggcaacaaag cggaggacac atctgaaaag ttggagttta 1080 cagcagaaaa gaaatcggac gcagcacaag gagcaacaag tgcctcgttt aattttggca 1140 agaaaattga gagctcagtt ttgggctctt taagctctgg ctccctaact gggttttcat 1200 tctctcctgg aaactctagt ttatttggta aagatgctgc ccagagtaaa gcagcctctt 1260 caccgttttc tgctaaagca tccgagagtc aagcaggagg cagcagcagt gagtgcagag 1320 atggtgaaga agaggagagt gatgagccac ccaaggtggt ggtgactgaa gtgaaggaag 1380 aggatgcttt ctactccaaa aaatgtaaac tattttacaa gaaagacaat gaatttaaag 1440 agaagggtgt ggggaccctg catttaaaac ccacagcaac tcagaagacc cagctcttgg 1500 tgcgggcaga caccaaccta ggcaacattc tgctgaatgt tctgatccca cccaatatgc 1560 cgtgcacccg gacagggaag aacaatgtcc ttatcgtctg tgtccccaac cccccacttg 1620 atgagaagca gcccactctc ccggtcacca tgctgattcg ggtgaagacg agcgaggatg 1680 ctgatgagtt gcacaagatt ttactgcaga aaaaggacgt ctgagcactg aggctgacca 1740 gggcacgtca ccatgttgct gcttcccttt gcccctaaac ttagtcacat tctttcctct 1800 ttgtactgtg acattctgag aacttctagg taacttgaaa cttttgtgag gaagattaag 1860 gccaataaat cctttcagtg tgtcgaagct gttctccctt cctaagaact aagcaaaata 1920 cattggagtg aaaagtttgg gaagattttt taatgtcgat tcattgagta aactaaccta 1980 agtgattctt acggactgta atcagggtac cagttagctc tccaaaggct ccctcaggca 2040 gccacgggtg ccactctctt cctgccctgg gagactcaat ggcagtgtcc acagagttcc 2100 agaagacgcc tgctcccctc ctgtgggctt gtttggtccg tgactagcac tcctgccaaa 2160 taccacaccg gcacactgta accgcgcttg ttctgtttcc ttctgagcgg aactgtgggt 2220 cctggggatc tctgtcctta gcctgttttt gacaggtgct ggcctttgac ctggaactgc 2280 ttgactgaac caggcactgc ctttccatgg gaagagaggg caggtagtgg cttgtcgggg 2340 agctggcgag gtatagactg ggttttgtcg tttatccatg aggtgctctt acttgcttac 2400 ctccctagtt aacatggatg ggggctgtca ggaataatgg attttattaa aagcacaaat 2460 ttggtagcat tttttaaata tcatttctat accacaaaca agcctttatt ttaaaagaaa 2520 aaagtgaaat tgtgctgaaa agggttgtat gctcgtggcc gtttgtgccc cggggacctg 2580 gtggtcacag tctgaatgga gccctgtcag agggtgctcg tgagaaggaa gagtgtggga 2640 gtggagacgg ctcattcggc ctgtgacttc agacagcagc atgttattca gcgtctgaaa 2700 agvccaattt catttttacc tttttacgga tgaggtattc ttgagtctgc ccacttggaa 2760 gagagccatg gttctacacg ccattcatgc cacttgagag cggtgaggtt accgatacta 2820 gcattctctc aaggccagac atcagttaaa tgcaaggttt gttgacgagc tggtcacggc 2880 tccctcctaa ccccagtgcc ggaaagctga ggcaggaaga ccgtgtgagt tcacaacagc 2940 ctgagctacg cagtaggaac ccctccccct aaaataaaat aaaagtttgg tgtctaaaaa 3000 aaaaaaaaaa aaaaaaaaaa aaaaaaa 3027 145 535 DNA Rattus norvegicus 145 cggccgcctt cctggcccag caggagagcg agattgctgg catcgagaat gactcgggtt 60 tcggggcacc tgccgccagc caggtggcct ctgcgcagcc cggactcgcg agcgggggtg 120 gttcggagga catggggact acagtcaatg gagatgtgtt tcaggaggct aacgggcctg 180 ccgatggcta cgctgcgatt gcccaggcgg acaggttgac tcaggagcct gagagcatcc 240 gcaagtggag agaggagcag aagaaaaggc tgcaggagtt ggatgctgcc tcgaaggtga 300 ccgaacagga gtggcgggag aaggccaaaa aagacctgga ggagtggaac cagcgccaaa 360 gtgaacaggt tgagaagaac aagatcaaca acagggcatc ggaagaggct tttgtgaaag 420 aatccaagga ggagacccca ggcacagagt gggagaaggt ggcccagctg tgtgacttca 480 accctaagag cagcaagcaa tgtaaagacg tgtcccgcct gcgctcggtg ctcat 535 146 2046 DNA Rattus norvegicus 146 cgccgcggcg gccgcagagg cggaggccga ggccgaggcg caggggggcg cgccccgggc 60 ccaggcccgg ccccagctgc cgctgcggag cccgccggga ggccccggag cgcggccaca 120 gcgcagctgc tgccatggcg cagaccctgc agatggagat tccaaacttt ggcaacagca 180 tcctcgagtg cctcaatgag cagcggctac agggactgta ttgtgacgtg tcagtggtgg 240 taaagggcca tgccttcaaa gcccaccgtg ctgtgttggc cgccagcagc tcctacttcc 300 gggacctatt caacagcagc cgcagtgctg tggtagaact gccagccgct gtgcagccac 360 agtcattcca gcagatcctc acgttttgtt atacaggccg gctgagcatg aacatggggg 420 accagttcct gctcatctac acagccggct tcctgcagat ccaggagatc atggagaaag 480 gcactgagtt cttcctcaaa gttagctctc caagttgcga ctcccagggc ctgcacccgg 540 aggaggcccc atcctcagag cctcagagtc ctgtagcgca gatattgggc tggccagcct 600 gtagcacgcc actgcccctt gtgtcacggg tcaagacaga acaggagttg gactcggtgc 660 aatgcacacc catggccaag aggctatggg atagcagcca gaaggaagct ggaggcagtg 720 gtggcaacaa tggcagccgc aagatggcca agttctccac gccagacctg gcccctaacc 780 ggatgcccca gccagtctct gtggccacag ctacagcagc agtggctgtg gttgcagtgg 840 ggggatgtgt gagtgggccc agcatgtcag agcggaccag cccaggtacc tccagtgctt 900 acactagtga cagccccagc tcctaccaca acgaagaaga cgaagaggaa gatgcaggtg 960 aggagggcac agatgagcag taccgtcaga tctgcaatat gtataccatg tacagtatgt 1020 tgaacgttgg ccagacagtt gagaaggtgg aggctcttcc tgagcaggtc gtccttgagt 1080 cccacagtcg cattcgagtg cggcaagacc tggcatctct cccagctgag ctcatcaacc 1140 agatcggcaa tcgctgccac ccaaagctct acgatgaagg cgacccctca gagaagctgg 1200 agcttgtgac aggcaccaat gtatacatca caagggcaca actcatgaac tgccacgtca 1260 gtgcaggcac gcggcacaag gtcttgctgc ggcggctcct ggcttccttc tttgaccgga 1320 acacactggc caatagctgt ggcaccggca tccgttcttc caccaatgac cctagacgca 1380 agccactgga cagtcgtgtc ctccatgctg tcaagtacta ctgccagaac ttcgccccca 1440 acttcaagga gagcgagatg aatgccattg cagccgacat gtgcaccaat gcccgccgag 1500 tggtccgtaa aagctggctg cccaagacca agccgctaca cctggtggag ggcgataact 1560 acagcagctt catcagcgac actggcaaga tagaaccgga catgatgagc atggaacaca 1620 gcttcgagac agccagccac gatggcgagg ctggcccttc agctgaggtt ctccagtaac 1680 atacatgtga caccccctta ccggatgtca cattccccct cctatcacac ccccacctac 1740 cacctacctg gtcacgatct actgtctgtc cctccccaga acctgtgagg gggtgttcag 1800 tgccctctat cggatgcaag agctggctga ccaaggccaa agcactgtac ctagcagagg 1860 gcagtgccga cagcagcttt ctcaatgacc atgaggaaga gctgaacttg gtaggcatag 1920 aatacagctt ccacacagac agccccctca gcagaggtcc tccagtacct gcccaggacc 1980 ctcccatgga tgtcacactc ccctcctgtc acacacatac ccccacctta gtcacgagct 2040 actgtg 2046 147 312 DNA Rattus norvegicus 147 gtctcacgtc ctctctgcac tctggaccct gacttcaccg acatgaaaac tcattacttt 60 ctcctggtga tgttattttt tctcttctcc cagatggagc tgggtgctgg cattctcaca 120 agtcttggac gcagaacaga tcaataccga tgcctccaaa atggaggatt ctgtctccgc 180 tccagctgcc catctcatac caaactacaa ggaacatgta aaccagataa gcccaactgt 240 tgcaggagtt gacagtggtt tgaagaatgg acataaagga caagcaaggg attgtaaaat 300 tagtgtttta at 312 148 3822 DNA Rattus rattus 148 cttcactggg ggcccttagg cgagaggagt ttccaaattg ggtaaaaggc agagtggagg 60 aggggaggtg ataattagca aagttgtaga cctctgaacc ttctgggtct gaagcccctc 120 cctgtgagcg tgggggagac tcactctccg gtgggggggc cgtttgggtc ccccccaccc 180 ctactccctc gctcctttac accccgggct ctctcctggc ctcctacccc tgcaccctgc 240 atccatcatg acggtgatgt caggggagaa tgcagacgag gcttcggccg ctccaggtca 300 cccccaggat ggcagctacc caaggcaggc ggaccacgac gaccacgaat gctgcgagcg 360 cgtggtgatc aacatctccg ggctgcgctt cgagacgcag ctcaagactc tggcccagtt 420 ccccaacacg ctgctgggca acccgaagaa acgcatgcgc tactttgacc ctctgaggaa 480 tgagtacttc tttgaccgca accggcccag cttcgatgcc atcctttatt actaccagtc 540 gggggggcgc ctgcgcaggc cggtcaacgt gcccctggac atgttctccg aggagattaa 600 attttacgag ttgggcgagg aggccatgga gaagttccgg gaagatgagg gcttcatcaa 660 ggaagaggag cgccccctac ccgagaagga gtaccagcgc caggtgtggc tgctctttga 720 gtatccggag agctcaggac ctgcacgggt tattgccatt gtatccgtca tggtcatcct 780 catctccata gtcatctttt gcctggagac tctccctgag ctgaaggatg acaaggactt 840 cacgggcacc attcaccgca tcgataacac cacagtcatc tacacttcta acatcttcac 900 agaccctttc ttcattgtgg aaaccttgtg tatcatctgg ttctcttttg agctggtggt 960 gcgcttcttc gcctgcccca gcaagacaga cttctttaag aacatcatga acttcatcga 1020 cattgtggcc atcatccctt atttcattac cctgggcaca gagatagctg agcaggaggg 1080 gaatcagaag ggcgagcagg ccacttccct ggccatcctc agggtcatcc gcttggtaag 1140 ggtgttcaga atcttcaaac tctcccgcca ctccaagggc cttcagatcc tgggccagac 1200 cctcaaagct agtatgaggg agttagggct gctcatcttt ttcctcttca ttggcgtcat 1260 actgttttct agtgcagtgt actttgcgga ggcggaagaa gctgagtcgc acttctccag 1320 tatccccgat gctttctggt gggcggtggt gtccatgacc actgtgggat acggtgacat 1380 gtaccctgtg acaattggag gcaagatcgt gggctccttg tgtgccatcg ctggtgtgct 1440 gacaattgcc ctgcccgtac ctgtcattgt gtccaatttc aactatttct accaccgaga 1500 aactgagggg gaagagcagg ctcagttgct ccatgttagt tctcctaact tagcctctga 1560 cagtgacctc agccgccgca gctcctctac tatcagcaag tctgagtaca tggagatcga 1620 agaggacatg aacaatagca tagcccacta caggcaggct aatatcagaa ctggtaactg 1680 caccgcaact gatcaaaact gcgttaataa gagcaagctc ctgaccgatg tttaaaaaaa 1740 gcaccaggca agcaatcaaa agcccccaaa caaaaccctt ggcgactcct gtcccactct 1800 gtagatactt tactaaaacc gtagtctttg aatgctttat ttaactggca atgcactgtt 1860 gcattgtgaa tttggggggt gggcaaacct gaagctttca agatcacatt taaaaaacaa 1920 aaccaaccaa acaagcaaaa agaaaaaaaa aacccaacaa aaaatataaa aactcaaccc 1980 aacccaataa ccaactattt tcatttttat ttaaaaaatg agaaaagaaa gaggattttc 2040 taaaacgctg cccatgaagt agtctgtgtg aaataagact catgctttcc ttgtactgaa 2100 gtttttccaa tcttttggct taagttgttt tgtttgtttt ttttaaacct aaaaatcaga 2160 tgaccactta ggaacataaa aattcaaatt tgcatggaac tccactgtaa aatttttgca 2220 aattgcacag cacatgtcag atagtgtgcc ccgtggaaca ccatgtaaca gcctcagtgg 2280 tcagtggggg gaaaaatgct tttattttga tcaactgaat tgcatacaag gctaacaaaa 2340 tccggactca ttaagaatgg ttcagaaagc accttgcaaa tctgttactg gtcccaatct 2400 gttgggattt tccatctgcc ccgttctcct aaatcccagt ctattctcta agaaaagggc 2460 aacttgatta aatgagttgt ttcatctgta aggctgctaa gttctctcaa ctgcagatga 2520 tccaaatata ggtttgtttt ttttttaaac caatcctgac ccctgacctt cagaagtgga 2580 tgataaacct taccctcctt attgcaagag cacaagagtt caatggtaag catgtttgaa 2640 tccgataaca tttattttat aatcgcatgc tgagaaagtt agcccagaca atagtgaata 2700 agcttacgtt gaaatcgact cttctaaata tagtccgttt catttgcatt caccaaaagt 2760 gcactccttc atttattaac tcttttctta gcagctaaag tactgtattt aagtacgtac 2820 cttagatggg gacagtccct tttccgagct caaagcatgt tctcttagtc agcattatgg 2880 cctatttgat taagatatac cttgaattaa ttaatgcatg gtttcagtaa taaaaaaaat 2940 tagaaaatac taaaaattac aagcctgtgg gacgaaaggc caaaggacac gggggtgggg 3000 ggtggggtgg gggaactccg tcattttcct gcctttgctc agggaaatgt caagcttcta 3060 tgcaggtata gacagagaga ggaccaatat gcccatcctt taaggggaaa ctgtggaaaa 3120 ctaaataaat cattcaaggt atttaataga cctaaaacca agcattcttt ctagctgaac 3180 ataaatacaa gcaaaacaaa caaacaaaca aacaaaaaaa aggtgcaata ttgcatggtt 3240 tcttggtgca ttcttaggat gtaagtgata acgctgacct cttcatgcat ccagagcaga 3300 gccgatttct tttcgcagtc atgatttgaa gtctatagag acttcggccc tcccccctga 3360 ggctccctga agaaactcag ccaattgatt taatacttgc ttagtgcctt tatctgtacc 3420 cacagtgaac tgcagaaaag tgcctccata actcagctgg gaagttattt aacagaaggg 3480 aggaagggtt ggggcacaga cctttttgct ttttgttttg tttgtttttc catcctcact 3540 gtctcacttc accactgtga gaagacctct ccaccctcag agcccccaaa gaagagagag 3600 agagagagag aaagcaggtg ctgtctctct tggctgtcta ctggacttgg tctctttggc 3660 agcctgactc tggatatgaa ctgagaccca tctttgaagt ggacatgaac cataaactgg 3720 ttctattctg ttttgttctg ttctgttttg tttcttctcg accagaagcc aagagaaatg 3780 tttttgggaa tgtggaaggc cactccggac atacaaagct tc 3822 149 543 DNA Rattus norvegicus 149 ttctgtatga aataatttat tgtagcattg tcaagattgg cattatttta cagtattttt 60 ttttctcttc aagaactacg agtctaaaga aataaaggaa aactacttta attagaacta 120 tctaaataaa atcttctgtt ttggtttata tcagatagat ttacagacat attgtcactg 180 agaaatagag tgattccatt atataaaata tggcaaaaag ggtcccccca aatactgttc 240 aacaacacta tggtttaata gtttaattat agtttaaatt ctcatctgag aaacctaaaa 300 atgtactgaa tggcttgtgt ggggacagtg ctgtgtttta atttgtactt tgcccaatca 360 tcctcccttg aagaaaactc agggtaagct acttgctaaa ctctctaagt aactcaatca 420 agaaaacaca attgctattc aataaaaaaa aaatccaatt taagaaaaga aaaagaaaac 480 aattcctcta caatagtctg taagaccaga atagatacac aacaaattta atggttaaat 540 tta 543 150 410 DNA Rattus sp. 150 agaacaacaa atcaaaatgt aaacttaaaa tataaccaaa agagggacag ctctttagga 60 aaaggaaaaa accttaaata gtgaataaac aactacaacc acttaaccat tgtaggctta 120 aaagcagcca tcaataaaga aagcgttcaa gctcaacata catacttaca cacactaatt 180 ccacaaacct caataaattc ctatattaca aattgggcta atctatagac ccatagatga 240 aatactgtta atatgagtaa caagaaccaa ttctcctagc acaagtgtat gacaacccgg 300 ataaccattg tcaattatcg aatcataggt actaacccaa caataaaatt acctatccct 360 aactcgttag cccaacacag gcgtgcttta aggaaagatt aacaaaaaaa 410 151 1214 DNA Rattus norvegicus 151 gaattccggg ccccgcgctg ccgctgctcc tgccgtcgct gctcttgctg ctgctgttgg 60 gcgcgggcgg ttgcggtcct ggggtgcgcg ccgaggtgct gttccgctgc ccaccctgca 120 cgcccgagcg tctggccgcc tgcggacccc cacccgacgc gccctgcgcc gagctggtgc 180 gagagcccgg ctgcggttgc tgctccgtgt gcgcacgaca ggagggcgaa gcttgcggcg 240 tctacatccc gcgctgcgcc cagacgttac gctgttaccc caacccgggc tccgagctgc 300 ccctgaaggc actggtcacc ggcgcgggta cctgtgaaaa gagacgcgtg ggcgccaccc 360 cacagcaggt tgcagacagt gaggatgacc actcggaggg aggcctggtg gagaaccatg 420 tggacggaac catgaacatg ttgggaggca gcagtgctgg ccggaagccc cctaagtcag 480 gcatgaagga actggctgtg ttccgggaga aggtcaacga gcagcaccgg cagatgggca 540 aaggtgccaa acacctcagc ctggaggagc ccaagaagct gcgcccacct cctgccagga 600 ccccttgcca gcaggagctg gaccaggtcc tggagcgcat ctccaccatg cgccttccgg 660 atgatcgggg tcctctggaa catctctact ccctgcatat ccccaactgt gacaagcatg 720 gcctgtacaa cctcaaacag tgcaagatgt ctctgaatgg acagcgtggg gagtgctggt 780 gtgtgaaccc caatactggg aagccaatcc agggagctcc caccatccgg ggagaccccg 840 agtgccatct cttctacaac gagcagcagg agaatgatgg ggctcacgcc caaagggtgc 900 agtaaaccac agccagtcgg tgcctggctt ccccacccca aacaccagca gaaatggagg 960 gtgtcagggt gatgggtgtg gaggatttcc cagttttgac acatgtattt atatttggaa 1020 agagaccaac actgagctca gaagcccccc tgcgcccccc agtggcagtt aacctgtacc 1080 tccgttcctg cttctaatag agagggtggt ggcactgggg atactgggta caggcttggg 1140 aatggggaaa gaaattttta tttttgaacc cctgtgtctc ttttacttaa gattaaagga 1200 aggaaacgga attc 1214 152 3201 DNA Rattus norvegicus 152 ccgggtaaga aaataagctg ccctattttt ctttcttctt ctcttacaac tggaaccagc 60 catttcccca aactaccacc atggaggtgg caatggtgag tgccgagagc tcagggtgca 120 acagccacat gccttatggt tatgctgccc aggccagggc tcgagagagg gagagacttg 180 ctcactccag ggcagctgca gctctggctg ttgcagctgc cacggctgcg gtggaaggca 240 ctggaggttc tggtggaggc ccccaccatc atcatcagac acgtggggcc tactcctccc 300 atgatcctca aggaagccga ggtagtcgga ggaggaggcg acagcgaact gagaagaaga 360 aactccacca caggcagagc agttttcctc attgctcaga cctgatgccc agtggctctg 420 aagagaagat ccttagggag ctgagcgagg aggaggaaga cgaggaggag gaagaggagg 480 aggaagagga gggaaggttt tactatagtg aagaggacca tggggatggg tgttcctaca 540 ctgacctact gccacaggat gatgggggtg gcggcggcta cagttcagtc cgctacagtg 600 actgttgtga acgcgtggta ataaatgtgt ctggtctacg cttcgaaacc caaatgaaaa 660 ctttggctca gtttccagaa actctgttgg gagaccctga gaagaggact cagtacttcg 720 accctttgcg caatgagtat ttttttgata ggaaccgtcc cagctttgat gccattttgt 780 attattacca gtcaggaggc cgcctgaaga ggccagtcaa tgtccccttt gatatcttca 840 ctgaggaggt gaagttctat cagttgggag aggaagccct gctcaagttc cgtgaggatg 900 agggctttgt gagagaagag gaggacaggg ctctgccaga aaatgaattt aaaaaacaga 960 tttggcttct ctttgaatat cccgagagtt ccagccctgc cagggcgata gccatcgtat 1020 ctgtcctggt catcttaatc tctattgtca tattttgcct ggaaaccttg cctgagttca 1080 gggatgatag ggacctcatc atggccctca gcgcaggtgg acacagcaga ttattgaatg 1140 acacctcggc accccacctg gagaactcag ggcacacaat attcaatgac cctttcttca 1200 ttgtggagac agtatgtatc gtgtggtttt cctttgagtt tgtggttcga tgctttgctt 1260 gtcccagtca agcactcttc ttcaaaaaca tcatgaacat cattgatatc gtctccattt 1320 tgccttactt catcactctg ggcaccgatc tggcccagca gcaggggggt ggcaacggcc 1380 agcagcagca ggctatgtcc tttgccatcc tcaggatcat ccgtctggtc cgagtgttcc 1440 ggatcttcaa gctctccaga cactccaagg gcctgcagat cctgggccac accctaagag 1500 ccagcatgcg tgaactgggc cttcttatct ttttcctctt catcggggtc atcctctttt 1560 ccagcgctgt gtattttgca gaggcagatg aacctaccac ccatttccaa agcattccag 1620 atgcgttttg gtgggctgtg gtaaccatga caactgtggg ctacggggac atgaagccca 1680 tcacagtggg aggaaagatt gtggggtccc tgtgtgccat tgcgggtgtc ttaaccattg 1740 ctttgcccgt gccggtgatt gtgtctaact ttaactattt ctaccacaga gagactgaaa 1800 acgaagaaca gacccagctg acccaaaacg cagtcagttg cccataccta ccttctaatt 1860 tgctcaagaa atttcggagc tctacttctt cttccctggg ggacaagtca gagtatctag 1920 agatggaaga aggggtcaag gagtctttat gtggaaagga agagaagtgt cagggaaagg 1980 gggatgacag cgagacagat aaaaacaact gttctaatgc aaaggctgtg gagactgatg 2040 tgtgaatctc tttccccacc tgccgtgccg ccgcccagct ccgaatatat tcatacataa 2100 agaatgcagt tatgaaaatg agatatgcac tgcatacagt aatacactgc ttaatggcga 2160 tacatggcat aattgtggcg aaacgtgtat tgcatatcaa ataagtgatg catcttggag 2220 aagagggagg cattaaaaac agcagatcta tctttatatt ttttaataga atgcaagaat 2280 tttgcacata atgggaaaat gttaatagta aaggtggtcc cgaggagagt gagtgtgtgt 2340 gagagagtga gagagtgtgt ggccatggga gtgtaagtaa attgtcaaca ttgttgggaa 2400 ttgtgccgtg atgggaaaag ttggcattct gaagtattta ctatgtaaga actaatgaac 2460 ttgagcagtc ttttaccagt gttttaataa catctcctat gtctttggat tctgtagttg 2520 ttttctagaa attgtaagaa ttactgtgta gaaaaaagag aaagtaaatt atttaatagt 2580 atataggtca caatttaatc ttggatttaa ttaaagttta tttttaactg gaaattaact 2640 tttgaaaagg ctgcagggcc ttagaaattg attatatttt gttattaatt ttgggagata 2700 tactagcaaa tgcctaatgt tctggaggaa atgtaacaag ttttgttcac aggtcttaag 2760 actggaattt ttttttcttt tgcactactt tctatgctga agcccgagag agacttcata 2820 ctgtgaatgt ttactaacgc accaatcagt tcaatgacaa tcattggaag aatggtttct 2880 tcgtctcatt tattgttctt ttcattttgt gagactaatg agcacacaga taacagcaca 2940 cgattcctgc tttaaaatct gaacaaccga tctacaaagg gactacgaag taacgttcag 3000 cagccgaatc tttcaaaatt ggtttgttac aatgatgctt cagaaaccat actattttca 3060 atactcttct gccttttaag tccagaataa tttaaccaaa gttattgcat gcacagaaag 3120 aattccggca ttttgttgca tatttaataa aaagatctta agccataatt gctgtagctg 3180 ctgggcgcct tatttctcca a 3201 

What is claimed is:
 1. A method for identifying a compound to treat a neuropsychiatric disorder, which method comprises: (a) contacting a cell with a test compound; (b) determining expression by the cell of one or more signature genes, each said signature gene comprising a nucleic acid that hybridizes to a nucleic acid selected from the group consisting of SEQ ID NOS:1-154 and the complements thereof; and (c) comparing the determined expression of the one or more signature genes to expression in a cell not contacted with the test compound, wherein changes in expression of the one or more signature genes consistent with a therapeutic effect indicate that the test compound is useful for treating the neuropsychiatric disorder.
 2. The method according to claim 1 in which the neuropsychiatric disorder is selected from the group consisting of, schizophrenia, autism, major depressive disorder (MDD), bipolar affective disorder (BAD), schizophrenia and psychotic depression.
 3. The method according to claim 1 in which the cell is a neuronal cell.
 4. The method according to claim 1, wherein changes in expression of signature genes which are similar to changes observed in ECS indicate that the test compound is useful for treating the disease or disorder.
 5. The method according to claim 1, wherein changes in expression of signature genes which are similar to changes observed in ECT indicate that the test compound is useful for treating the disease or disorder.
 6. The method according to claim 1, in which changes in the expression of signature genes are evaluated from a value (V) comprising the sum of each signature gene's change in expression.
 7. The method according to claim 6, in which said value (V) is determined from the normalized change (E_(i)) in expression of each efficacy gene (i) weighted by the score value (ω_(i)) according to the relation: $V = {\sum\limits_{i}{\omega_{i}{E_{i}.}}}$


8. A method for selecting one or more signature genes that are indicative of an effective therapy for treating a neuropsychiatric disorder, which method comprises identifying nucleic acids that are differentially expressed in an individual subjected to electroconvulsive seizure (ECS) compared to an individual not subjected to ECS.
 9. The method according to claim 8 in which the neuropsychiatric disorder is selected from the group consisting of, schizophrenia, autism, major depressive disorder (MDD), bipolar affective disorder (BAD), schizophrenia and psychotic depression.
 10. The method according to claim 8 wherein the individual is subjected to acute ECS.
 11. The method according to claim 8 wherein the individual is subject to chronic ECS.
 12. The method according to claim 8, wherein nucleic acids are identified that are differentially expressed in the hippocampus of an individual subjected to ECS compared to expression in the hippocampus of an individual not subjected to ECS.
 13. The method according to claim 8, wherein nucleic acids are identified that are differentially expressed in the frontal cortex of an individual subjected to ECS compared to expression in the frontal cortex of an individual not subjected to ECS.
 14. The method according to claim 8, wherein: (a) a score value is obtained for each of the identified nucleic acids, the score value for each gene being a function of each gene's differential expression in individuals subjected to ECS, and (b) signature genes are selected which have the highest score value.
 15. The method according to claim 8, wherein nucleic acids that are differentially expressed comprise one or more nucleic acids that hybridize to a nucleic acid selected from the group consisting of SEQ ID NOS:1-54 or to a complement thereof.
 16. A kit for detecting an ECS gene signature, wherein said kit comprises a plurality of oligonucleotides, each of which is capable of specifically hybridizing to a different ECS signature gene.
 17. A kit according to claim 16, wherein each ECS signature gene is a nucleic acid having a nucleotide sequence selected from the group consisting of SEQ ID NOS:1-152, a complement thereof or a homolog thereof.
 18. A kit according to claim 17, wherein the ECS signature genes comprise at least one homolog of a sequence selected from the group consisting of SEQ ID NOS:1-152, or a complement thereof.
 19. A kit according to claim 18, wherein said homolog specifically hybridizes from a nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOS:1-152 and their complementary sequences.
 20. A kit according to claim 18, wherein said homolog comprises a nucleotide sequence that is at least 80% identical to a nucleotide sequence selected from the group consisting of SEQ ID NOS:1-52 and their complementary sequences.
 21. A kit according to claim 16, wherein each of the plurality of oligonucleotides is immobilized on a solid surface or support.
 22. A kit according to claim 21, wherein each oligonucleotide is immobilized at a known position on the solid surface or support.
 23. A kit according to claim 16, wherein at least some oligonucleotides in the plurality of oligonucleotides are capable of priming reverse transcription of an ECS signature gene.
 24. A kit according to claim 23, further comprising a polymerase and nucleotide bases.
 25. A kit according to claim 24 wherein the nucleotide bases are detectably labeled.
 26. A kit according to claim 16 comprising oligonucleotides capable of specifically hybridizing to at least 10 different ECS signature genes.
 27. A kit according to claim 16 comprising oligonucleotides capable of specifically hybridizing to at least 50 different ECS signature genes.
 28. A kit according to claim 16 comprising oligonucleotides capable of specifically hybridizing to at least 100 different ECS signature genes.
 29. A kit according to claim 16 comprising oligonucleotides capable of specifically hybridizing to at least 150 different ECS signature genes. 